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Abstract 

We investigate the constrained capacity of multiple-antenna fading coherent channels, where the receiver knows 
C/2 ■ the channel state but the transmitter knows only the channel distribution, driven by arbitrary equiprobable discrete 

inputs in a regime of high signal-to-noise ratio (snr). In particular, we capitalize on intersections between information 
theory and estimation theory to conceive expansions to the average minimum-mean squared error (MMSE) and 
the average mutual information, which leads to an expansion of the constrained capacity, that capture well their 
behavior in the asymptotic regime of high snr. We use the expansions to study the constrained capacity of various 
multiple-antenna fading coherent channels, including Rayleigh fading models, Ricean fading models and antenna- 
correlated models. The analysis unveils in detail the impact of the number of transmit and receive antennas, transmit 
and receive antenna correlation, line-of-sight components and the geometry of the signalling scheme on the reliable 
£N) . information transmission rate. We also use the expansions to design key system elements, such as power allocation 

and precoding schemes, as well as to design space-time signalling schemes for multiple-antenna fading coherent 
channels. Simulations results demonstrate that the expansions lead to very sharp designs. 
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I. Introduction 

In recent years, the surge of interest in multiple-antenna communications systems has been due to the realization 
that the capacity of the canonical independent and identically distributed (i.i.d.) Rayleigh fading coherent channel, 
where the receiver knows the exact channel state but the transmitter knows only the channel distribution, scales 
as Q, m, El: 

C(snr) = min(n t , n r ) log(snr) + 0(1) (1) 

so that, at high signal-to-noise ratio (snr), the potential gain in reliable information transmission rate of a multiple- 
antenna over a single-antenna system grows (linearly) with the minimum of the number of transmit or receive 
antennas, min(n t , n r ), a quantity naturally known as the multiplexing gain or the degrees of freedom. 

Of particular relevance has also been the characterization of the capacity of multi-antenna communications systems 
that embody prominent channel features that go beyond the canonical model. The crux of the characterizations, 
which for analytical tractability have been usually pursued in the asymptotic regimes of low- and high-snr, are 
affine expansions of the capacity in terms of fundamental performance measures. At low-snr, in order to capture 
the tradeoff between rate, bandwidth and power, it is appropriate to expand the capacity as an affine function of 
||| dB as follows H, 0: 

C [Wo) - S ° ' [si -YW- ) + °{n- - W J ~ So • 1082 U / Nod (2) 

where ^| dB is the (transmitted) energy per information bit in dB, | dB is the minimum (transmitted) 

energy per information bit . required for reliable communication in dB, and So is the capacity slope therein 
in bit/s/Hz/(3 dB). At high-snr, the capacity is expanded as an affine function of snr|dB as follows (6], Q: 

C(snr) = -cj\+ o(l) (3) 

where snr|dB denotes snr in dB, Soo denotes the high-snr slope in bits/s/Hz/(3 dB) and denotes a zero-order term 
or a power offset in 3-dB units with respect to a reference channel with the same high snr slope but with unfaded and 
orthogonal dimensions whose expansion intersects the origin at snr|dB = 0. Since the low- and high-snr quantities, 
Sq and as well as S^ and Coc are a function of the random channel matrix, recourse to the elegant theory 

iv o min 

of random matrices (8} has disclosed the influence of various factors such as the number of transmit and receive 
antennas, antenna correlation, antenna polarization, line-of-sight components, spatially colored noise, interference, 
or even key signal features, on the reliable information transmission rate |4|, 0, Q, 0, iflOl . ifTTI . Ifl2l . offering a 
more realistic view of the potential of multiple-antenna communications. In general, the channel capacity, by virtue 
of the well-known logdct(-) expression (3), depends ultimately on the distribution of the eigenvalues of certain 
random matrices linked to the random channel matrix, a distribution which is known in some settings and in some 
asymptotic regimes (HJ. Consequently, other characterizations valid for general snr, rather than only asymptotic snr 
regimes, have also been pursued. 
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It is well known, assuming that the channel variation over time is stationary and ergodic, that the capacity of the 
multi-antenna fading coherent channel is achieved by using (complex) Gaussian inputs, i.e., by using codewords 
whose elements are drawn from a zero-mean circularly symmetric complex Gaussian distribution that satisfy a 
transmit power constraint. However, it is also very relevant both from the theoretical and perhaps more importantly 
the practical perspective to study the constrained capacity of multi-antenna fading coherent channels driven by 
arbitrary (discrete) inputs. This is due to the fact that practical constraints pertaining to the transmission and 
reception of information often dictate the use of discrete inputs, such as PSK or QAM constellations, in lieu of the 
ideal Gaussian ones. Such a study, in addition to a deeper understanding, could also offer concrete guidelines for 
signal and system design. 

The characterization of the constrained capacity of systems driven by non-Gaussian inputs poses a myriad of 
challenges due to the absence of explicit and tractable mutual information expressions. A innovative approach 
towards the resolution of this class of problems was put forth in fPH . by exploiting connections between key 
quantities in information theory and estimation theory (T4), lfT31 . By drawing upon the I-MMSE identity [ 14], 
Lozano et al. [1131 have studied optimal power allocation policies for parallel non-interfering Gaussian channels 
with arbitrary inputs. Perez-Cruz et al. [16) and Parayo and Palomar ifTTl , |[T8l have in turn studied optimal power 
allocation and precoding for (interfering) multiple-input multiple-output (MIMO) Gaussian channels with arbitrary 
inputs (see also |fl9l , ||20l . lETl . |22l . ]23l ). Optimum power allocation for multiuser OFDM systems with arbitrary 
signal constellations, both in scenarios where the transmitter knows the fading channel state and in scenarios where 
the transmitter knows only the fading channel distribution, has been addressed using identical techniques in l24l . 

This paper pursues the characterization of the reliable information transmission rate of multi-antenna fading 
coherent channels with arbitrary (discrete) inputs, by capitalizing on the intersections between information theory 
and estimation theory. The coherent channel model, where the receiver is assumed to know the exact channel state 
but the transmitter is only assumed to know the channel distribution, is a particularly relevant one because the 
use of bandwidth limited feedback channels between the receiver and the transmitter only enables the transmission 
of statistical, rather than instantaneous, channel state information in (fast fading) mobile communications systems. 
Due to the difficulty in constructing a general analytical characterization valid for all snr, the analysis focus on the 
key asymptotic regime of high snr in order to shed important insight about the fundamental communication limits. 
It is important to note, though, that the characterization of the reliable rate in coherent fading channels, where the 
receiver knows the exact channel state but the transmitter knows only the channel distribution, is considerably more 
complex than in MIMO Gaussian channels, where the (fixed) channel matrix is known exactly to the receiver and 
the transmitter (e.g., see ifTBI . Ifl7; l and [18]). The most important aspect relates to the fact that the reliable rate is 
defined by the average of the mutual information with respect to the channel matrix distribution, rather than the 
mutual information conditioned on the (fixed) channel matrix only, as in fl6l . ifPTl and Ill81 . Consequently, we 
use the I-MMSE identity as a platform to explore other key analysis techniques, most notably, the machinery of 
asymptotic analysis and expansions, that lead to the exposure of the behavior of the reliable rate of multi-antenna 
fading coherent channels driven by arbitrary (discrete) inputs in the asymptotic regime of high snr. This paves the 
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way to the characterization of the constrained capacity of key multi-antenna fading coherent channel models as 
well as the design of key system elements. 

A. Contributions 

This paper contains various original contributions, which include: 

• Analytic characterization of the constrained capacity of multi-antenna fading coherent channels driven by 
arbitrary discrete inputs in the asymptotic regime of high snr. The contribution reveals that, whilst at low-snr 
the capacity of a multiple-antenna coherent channel driven by Gaussian inputs and the constrained capacity 
of a multiple-antenna coherent channel driven by (proper complex) non-Gaussian inputs admit a similar 
characterization |4j, at high-snr the behavior of the reliable rate for systems driven by the capacity-achieving 
Gaussian inputs and for systems driven by arbitrary discrete inputs is radically different; the contribution 
also reveals that the asymptotic characterization of the constrained capacity depends on the distribution of 
certain quadratic forms in Gaussian random variables, rather than the distribution of the singular values or 
the eigenvalues of certain random matrices. This aspect facilitates considerably the characterization of the 
constrained capacity of multi-antenna fading coherent channels. 

• Analysis of the constrained capacity of various multi-antenna fading coherent channel models, including 
Rayleigh fading models, Ricean fading models and antenna-correlated models. The contribution emphasizes 
the impact of the channel properties as well as the signal geometry on the constrained capacity of multi-antenna 
fading coherent channels 

• Design of system elements for various multi-antenna fading coherent channel models. In particular, the 
contribution illustrates with some detail the design of power allocation and precoding procedures as well 
as the construction of space-time schemes for various models. 

In addition, the analysis also unveils intimate connections between the asymptotic behavior of key performance 
measures, namely, the average (non-linear) minimum mean-squared error, the average mutual information and the 
average error probability, whose interest may transcend the domain of application. 

B. Organization 

This paper is organized as follows: Section [TT] describes the multiple-antenna fading coherent channel model. 
Sections Hill concentrates on the construction of a high-snr expansion for the reliable information transmission rate 
- the constrained capacity - of multiple-antenna fading coherent channels driven by arbitrary discrete inputs. The 
application of the expansions in the characterization of the reliable rate of the multiple-antenna canonical i.i.d. 
Rayleigh fading coherent channel as well as other multiple-antenna coherent channels that incorporate a variety 
of features is considered in Sections [IV] and [VJ respectively. We study in detail the effect on the reliable rate 
of Rayleigh fading, Ricean fading, the number of transmit and receive antennas, antenna correlation, as well as 
the signal properties. In turn, the application of the expansions in the design of communication system elements is 
considered in Section [VI] In particular, we study power allocation in a bank of parallel independent fading channels, 
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power allocation and precoding in multiple-antenna fading channels, and space-time signal design. Section IVIII 
summarizes the main conclusions and contributions of the paper. 

C. Notation 

We use the notation: Boldface uppercase letters denote matrices (X), boldface lowercase letters denote column 
vectors (x), and italics denote scalars (x); the context defines whether the quantities are deterministic or random. 
The symbols I, and diag (di, d%, . . . , d n ) represent the identity matrix, the null matrix and a diagonal matrix 
with diagonal elements d\, d%, . . . , d„, respectively. The symbol represents a unit vector where the kth entry is 
equal to one and the other entries are equal to zero. The operators det (•), tr (•), rank (•), vec (•) and ||-|| represent 
the determinant, trace, rank, vectorization and Frobenius norm of a matrix, respectively. The operators ( ) T and 
(■y represent the transpose and the Hermitian transpose of a matrix, respectively. The operator E{-} represents 
the expectation operation. CM (//, S) denotes a circularly symmetric complex Gaussian random vector with mean 
[i and covariance S. 3?(-) and Sy (•) denote the real part and the imaginary part of a complex number. H (•) 
represents the entropy of a random variable or a random vector, H (-|-) represents the entropy of a random variable 
or random vector given another random variable or random vector and !(■;■) represents the mutual information 
between two random variables or random vectors. We also use the asymptotic notation: / (x) = O (g (x)) as x — > xq 

0. 



if liraj;-^,, 



/(*) 



< oo and f (x) — o (g (x)) as x ~ s- xq if \im x ^ Xo 

II. System Model 

We consider a fading channel with n t transmit and n r receive antennas which can be modeled as follows: 

y = Vsnr • Hx + n (4) 

for a single use of the channel, where y G C™ r represents the vector of complex receive symbols, x 6 C rlf represents 
the vector of complex transmit symbols, n ~ CAf (0, 1) 6 C" r is a random vector which represents the noise and 
H £ C Ur x n * is a random matrix which represents unit-power random channel gains between the various receive and 
transmit antennas, so that E {tr (Htf) } = n t n r . We take the input to conform to an equiprobable multi-dimensional 
constellation with cardinality M, i.e., x e {xi, X2, . . . , xm} and Pr(xi) = Pr(x2) = ■•• = Pi'(xm) = j^j, with 
S x = Ejxx^} = J_ • I, We also take the input, the noise and the channel to be independent. Therefore, the 
signal-to-noise ratio per receive antenna is given by: 

Eltr^HxHHx^)} E{tr(HHt)} 
SNR = snr ■ — ^ — ^ ^- = snr • — ^ — i ^- = snr (5) 

n r ntn r 

We consider a channel matrix that incorporates Rayleigh fading, Ricean fading as well as transmit and receive 
antenna correlation in a separable correlation model l25l . ll26l . J27), ll28l . given by: 



A'TT- Ho + V^Ti 



H =A/T^^ H o + \/T^^ e) i H w©T ( 6 ) 



where Ho € C" rX "' is a n r x n t deterministic matrix, H w € C™*" x "' is a n r x n t canonical complex Gaussian 
random matrix with independent zero-mean and unit-variance circularly symmetric complex Gaussian random entries 
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and ©t € C n * XTli an( j @ R g c^ir are nt x nt an( j nr x Ur unit-diagonal Hermitian positive semi-definite 
matrices with the correlation coefficients between the n t transmit and the n r receive antennas, respectively. The 
Ricean K-factor K corresponds to the ratio between the deterministic and random component energies. Since the 
term Ho corresponds to line-of-sight or diffracted components, Ho = a^a^ where the vectors ax € C" f and 
aR € C nr are associated with the transmit and receive array responses to a plane wave so that 1 1 a-i- 1 1 2 = n t 
and 1 1 aR. 1 1 2 = n r . Note that in channels with Rayleigh fading, where line-of-sight components are absent, K = 0, 
whereas in channels with Ricean fading, where line-of-sight components are present, K ^ 0; in addition, in channel 
models without transmit or receive antenna correlation @t = I and 0r = I and in channel models with transmit 
and receive antenna correlation ©t 7^ I and 0r 7^ I. 

We assume that the receiver knows the channel matrix realization and that the transmitter knows only the channel 
matrix distribution. We also assume that the sequence of random channel matrices over time is stationary and ergodic. 
Consequently, the constrained capacity, achieved by coding over multiple fading blocks, is given by: 

E H {I (x;Vsnr-Hx + n|H)} (7) 

where I (x; \/snr ■ Hx + n|H) is the mutual information between input vector x and the output vector y = ^/snr • 
Hx + n conditioned on a realization of the channel matrix H. The goal is to characterize, as well as optimize, the 
constrained capacity in the asymptotic regime of high snr. 

III. Characterization of the Constrained Capacity 

We provide a characterization of the constrained capacity of multiple-antenna fading coherent channels with 
arbitrary equiprobable discrete inputs in the regime of high snr. This characterization, which is the crux of the 
study of the effect of common channel parameters and models on the system performance as well as the design of 
key system elements in subsequent sections, is based on an asymptotic expansion of the constrained capacity that 
portrays its behavior as a function of key system parameters in the regime of high snr. 

The definition of the asymptotic behavior is based on the procedure: First, we consider lower and upper bounds 
to the MMSE associated with the estimation of the noiseless output given the noisy output of the channel model 
in |@), for a fixed channel matrix, given by: 

mmse (snr; H) = E |||Hx — HE {x| Vsnr • Hx + n} || 2 |h| (8) 

Second, we consider lower and upper bounds to the average value of the MMSE associated with the estimation of 
the noiseless output given the noisy output of the channel model in (0), for a random channel matrix, given by: 

mmse(snr) = E H {mmse (snr; H)} = E H [e j ||Hx - HE {x|\/snr • Hx + n} || 2 |h| j (9) 

We then consider upper and lower bounds to the mutual information between the input and the output of the channel 
model in |@}, for a fixed channel matrix, given by: 

I (snr; H) = I (x: \/snr ■ Hx + n|H) (10) 
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as well as upper and lower bounds to the average value of the mutual information between the input and the output 
of the channel model in (|4), for a random channel matrix, given by: 

T(snr) =E H {l(snr;H)} = Eh { I (x; y/sru ■ Hx + n| H) } (11) 

by capitalizing on the I-MMSE identity and counterparts lfl4l . lfl5l . Finally, we expose the asymptotic behavior 
as snr — > oo of the average value of the MMSE and the average value of the mutual information, which leads to 
the asymptotic behavior as snr — > oo of the constrained capacity, by capitalizing on the machinery of asymptotic 
analysis and asymptotic expansions ll29l . 

The bounds are expressed in terms of the squared Euclidean distance between the (noiseless) receive vectors 
Hxj and Hx 3 associated with the transmit vectors Xj and Xj given by: 

djj(H) = ||Hx 2 -Hxjll 2 (12) 

as well as their probability density functions p d 2 (djj) and the higher-order derivatives (djj) = d n (p d 2 (dj-)^ j d (djj 
1 2 

Let us consider the channel model in with a fixed channel matrix. It is possible to obtain lower and upper 
bounds to the MMSE by relying on the use of a genie based estimator and a (sub-optimal) Euclidean distance 
based estimator, respectively. The bounds, which we express in terms of the squared pairwise Euclidean distances 
djj (H) ,i 7^ j, rather than the squared minimum Euclidean distance d^ in (H) = min^ d? (H) are, as opposed to 
the bounds in 11161 . valid for all signal-to-noise ratios. This aspect is particularly relevant because the bounds to the 
average value of the MMSE follow from the bounds to the MMSE by averaging over the fading statistics. 

Lemma 1: The MMSE associated with the estimation of the noiseless output given the noisy output of the channel 
model in Q, for a fixed channel matrix, can be bounded as follows: 

mmsei_B (snr; H) < mmse (snr; H) < mmseuB (snr; H) (13) 

where the lower and upper bounds are given by: 



1 ™ , i / d?(H)snr . 

mmS6LB (5nr; H) = 4M(lvl3T) g g ^ (H) ' 2 ' 6rfC ( V 4 - ) (14) 



1 M M 1 I 

mmseuB (snr; H) = — ^ ^ df } (H) • - • erfc ^ 

i=i j=i \ 



'd? (H)snr 



(15) 



Proof: See Appendix A. ■ 

The lower and upper bounds to the mutual information are obtained by using the upper and lower bounds to the 
MMSE, respectively, in the integral form of the relationship between the mutual information and the MMSE given 
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by 031 Q 

I (snr; H) = log M — / mmse(£;H)d£ (16) 

•J snr 

The bounds, which we also express in terms of the squared pairwise Euclidean distances d? (H) , i ^ j, rather than 
the squared minimum Euclidean distance d^ in (H) = min^ d? (H) are, as opposed to the bounds in |[T6l . also 
valid for all signal-to-noise ratios. 

Lemma 2: The mutual information between the input and the output of the channel model in for a fixed 
channel matrix, can be bounded as follows: 

I lb (snr; H) < I (snr; H) < l UB (snr; H) (17) 

where the upper and lower bounds are given by: 

1 M M dS(H)s„r 

l LB (snr;H)=logM--^]r2-e-^- (18) 

i=i j=i 



1 M M 1 ( 

l UB (snr; H) = log M - £ £ - . erfc 

k ' i=l i=l \ 



fd? (H)snr 



(19) 



Proof: See Appendix B. ■ 

Let us now consider the channel model in (HJi with a random channel matrix. It is possible to obtain lower and 
upper bounds to the average value of the MMSE by averaging over the fading statistics the lower and upper bounds 
to the MMSE as follows: 

rtTrnseLB(snr) = Eh {mmseLB (snr; H)} = J mmsei_B (snr; H) • ph(H)<1H (20) 

mmseuB(snr) = E H {mmse UB (snr; H)} = J mmse UB (snr; H) • p H (H)dH (21) 

where pn (■) represents the distribution of H. The determination of insightful closed-form expressions for the 
integrals in d20b and ( f2TT > is not simple. Consequently, rather than attempt to solve the integrals, we will exploit 
results from asymptotic analysis to determine the asymptotic expansion of the integrals. The asymptotic expansions, 
which lead to considerable insight, are very useful in the regime of high snr. 

Lemma 3: Assume that (d^) ,n = 0, 1,2, . . . are continuous and integrable in [0,oo). Then, the average 
value of the MMSE associated with the estimation of the noiseless output given the noisy output of the channel 
model in (0), for a random channel matrix, can be bounded as follows: 

mmseLB (snr) < mmse (snr) < mmseuB (snr) (22) 

'This representation assumes that the matrix H is non-singular. If the matrix H is singular then I (snr; H) = H (Hx) — J^ r mmse (£; H) d^. 



where the asymptotic expansion as snr — > oo of the lower and upper bounds are given by: 

^ M M ^ 

o 

n=0 



1S e LB (snr) = ^ — l — ■ fc LB „ +1 • E&dft ) 

V 
/ 



. »=1 3=1 



snr 



1 

lV+3 



AT = 0,1, 



se UB (snr) = ^ — X — ■ k UBn+1 



n=0 



M M 



. i=l 3 = 1 
V 35« 



1 



snr 



W+3 



TV = 0,1, 



and 



fc LB„ 



; i if-; . 



n-4 n T(n + 3/2) 
2M (M - 1) ' r(n + 2) 

2 n-4 n T(n + 3/2) 



/tF r (n + 2) 



(23) 
(24) 

(25) 
(26) 



and r (•) is the Gamma function. 

Proof: See Appendix C. ■ 

The lower and upper bounds to the average value of the mutual information are obtained by using the upper and 
lower bounds to the average value of the MMSE, respectively, in the integral form of the relationship between the 
average mutual information and the average MMSE given by lfT3l P 

/•OO 

T(snr) = log M - / mrnse(£)d£ (27) 

J snr 

Once again, the determination of insightful closed-form expressions for the integrals is not simple. Consequently, 
rather than attempt to solve the integrals, we will also exploit results from asymptotic analysis to determine the 
asymptotic expansions of the integrals. 

Lemma 4: Assume that pty (d^) ,n = 0, 1,2, . . . are continuous and integrable in [0,oo). Then, the average 
value of the mutual information between the input and the output of the channel model in Q, for a random channel 
matrix, can be bounded as follows: 

h_B (snr) < T(snr) < T UB (snr) (28) 
where the asymptotic expansion as snr — > oo of the lower and upper bounds are given by: 

''mm ^ 



AT MM 

T LB (snr) = log M - £ ■ k' lBn+1 ■ ££^(0) 



n=0 



N 



. i=l j=l 



J 



n=0 



M M 



UB (snr) = logM - £ -Lj ■ k' UBn+1 ■ ££^(0) 



. i=l j=l 



o 



o 



snr 



N+2 



J 



1 



snr 



N+2 



N = 0,1,... 



AT = 0,1, 



(29) 



(30) 



2 This representation assumes that the matrix H is non-singular with probability equal to one. If the matrix H is not non-singular with 
probability equal to one then l(snr) = Eh {H (Hx|H)} — mmse(£)d£. 
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and 

2 4" r (n + 3/2) 
_ M 0F I> + 2) (31) 

k> 1 il r(n + 3/2) G2) 

fcuB - ~ 2M (M - 1) 0F r(n + 2) ^ 

and T (•) is the Gamma function. 

Proof: See Appendix D. ■ 

It is important to remark that, in order to determine the asymptotic expansions of the lower and upper bounds 
to the average value of the MMSE and the average value of the mutual information, we assume that the functions 
(^u) j " = 0,1,2,... are continuous and integrable in [0, oo). We verify the assumption for the most common 
fading channel models, including Rayleigh and Ricean fading models, in subsequent sections. 

The asymptotic expansions in Lemmas [3] and |4] are the basis of the characterization of the asymptotic behavior 
of the average value of the MMSE and the average value of the mutual information in the regime of high snr. Let 
us define the integer d > 1 as follows: 

{M M 
ne N " 1 H&d"^ ) ^° ) ( 33 ) 
i=l j=l " 

We now seek to define the rate at which the average value of the MMSE tends to its limit as snr — > oo as well 
as the rate at which the average value of the mutual information tends to its limit as snr — > oo, i.e., 

lim l0gM(snr) (34) 
log snr 



snr— too 



and 



log (log M -T(snr)) 
— lim — (35) 

snr->oo log Snr 

These rates are trivially bounded as follows: 

log mmse UB (snr) logmmse (snr) log mmse LB (snr) 
— lim < — lim < — lim (36) 

snr-s-oo logSnr snr-yoo logSnr snr-yoo logSnr 

log (log M — Tlb (snr)) log (log M - T (snr)) log (log M - T UB (snr)) 
— lim 1 : < — lim — : < — lim : (37) 

snr-s-oo log Snr snr-i-oo logSnr snr-s-oo logSnr 

By capitalizing on the asymptotic expansions embodied in Lemmas [3] and 0] it is possible to write as snr — > oo: 

/ mm \ 



and 



log mmseLB (snr) = log 



- (d + l)logsnr + C ( — ) (38) 



i=l 3=1 



snr 



3 We will see that the fact that ( ) , n = 1, 2, . . . may be discontinuous at zero is immaterial. 
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and 



so that 



and 



log mmseuB ( snr ) = log 



wEE^~ 1} (o) 



1=1 j=i 



- (d + 1) logsnr + — 
1 snr 



/ 



log (log M — I LB (snr)) = log 



M M 



log (log M - Iub (snr)) = log 



i=i 3=1 



M M 



V 



= 1 3=1 



d logsnr + — 
1 snr 



dlogsnr + O — 
1 snr 



/ 



lim 

snr— ¥ oo 



logmmse(snr) 
- lim 

snr-s-oo log Snr 

log (logM -T(snr)) 



lim 

snr— >oo 



log mmseLB (snr) 



log snr snr'-s-oo logsnr 

We also seek to define, in addition to the rates at which the average value of the MMSE and the average value of 
the mutual information tend to their infinite-snr values, a finer characterization of the high-snr asymptotic behavior. 
Towards this end, we define the quantities: 



log snr 
log (logM - T LB (snr)) 



logmmseuB (snr) 

lim = d + 1 

nr^oo logsnr 

log (logM -T UB (snr)) 



(39) 



(40) 



(41) 



(42) 



— lim 

snr— >oo 



logsnr 



= d (43) 



£d = limsup snr d+1 ■ mmse(snr) 



(44) 



e d = liminf snr d+1 • mmse(snr) 



and 



t d = limsup snr d • (log M — I (snr)) 



snr— ¥ oo 



= liminf snr d ■ (logM - I (snr)) 



(45) 



(46) 



(47) 



Note that these quantities, in view of the asymptotic expansions embodied in Lemmas [3] and [4] can be bounded as 
follows: 



MM MM 

fc L Bd • EE^ _1) (°) fc UBd • EE^ 1} (°) 



(48) 



i=l 3=1 



i=l 3=1 



and 



MM MM 

„(d- 1 )/'n\ ^ ^ tI / W Y^^Jd-l), 



*k -££*S ">) <^ < ^ < fc LBd -EEfS _1) (o) 

4 1 ' IJ 1 ' ' ' IJ 

i=l 3=1 i=l 3=1 

Note also that the bounds in d48T > and d49l are finite for key fading models (see Sections HVl and |VT>. 



(49) 
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The following Theorems, which are based on these considerations, characterize the asymptotic behavior of the 
average value of the MMSE and the average value of the mutual information in the regime of high snr. 

Theorem 1: Assume that pw (d?) , n = 0, 1, 2, . . . are continuous and integrable in [0, oo). Then, in the regime 
of high-snr the average value of the MMSE associated with the estimation of the noiseless output given the noisy 
output of the channel model in can be expanded as follows: 

mmse (snr) = e d (snr) • + O (^j+j ) (50) 

where ed (snr) is a piecewise infinitely differentiable function such that: 

MM MM 

fc L Bd ■ EE4j _1) (°) ^ e « ( snr ) ^ k ^ ■ E5>djj" (51) 

i—l j — 1 i—1 j — 1 

in the interval [snro,oo) for a sufficiently high value of snro and 

lim sup e d (snr) = ?d and liminf ed (snr) = e d (52) 

snr->oo snr-s-oo 

Proof: See Appendix E. ■ 

Theorem 2: Assume that p^ (d?) , n = 0, 1, 2, . . . are continuous and integrable in [0, oo). Then, in the regime 
of high-snr the average value of the mutual information between the input and the output of the channel model in 
(HJi can be expanded as follows: 

T(snr) = log M - e d (snr) • -L + O (53) 
where e d (snr) is a piecewise infinitely differentiable function such that: 

MM MM 

fc uB d • EE^ _1) (°) < ^ ( snr ) < fc L Bd • EEp2 _1) (°) ^ 

— ij — 

i=l 3=1 i=l j'=l 

in the interval [snro,oo) for a sufficiently high value of snro an d 

lim sup e d (snr) = e d and liminf e d (snr) = e d (55) 

snr->oo snr->oo 

Proof: See also Appendix E. ■ 
Note that if 

e d = e d = lim snr d+1 • mmse (snr) = e d (56) 

snr— > oo 

and 

e d = £a = lim snr d • (log M -T(snr)) = e d (57) 

snr — foo 

then the asymptotic expansions exposed in Theorems [T] and [2] reduce immediately to: 

snr d + l I snr 



mmse (snr) = e d • + ( ) (58) 
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and 

T(snr) = logM-ei~+of^— ) (59) 
snr d \snr d+i / 

Therefore, in view of d27b . d58l > and ( |59l it is possible to establish the relation: 

e' d = lim snr d ■ (log M — T(snr)) = - • lim snr d+1 • mmse (snr) = - ■ e<j (60) 

snr— >-oo d snr— ►oc d 

so that, as expected, the scaling constants that define the high-snr asymptotics of the average value of the MMSE 
and the average value of the mutual information are also related. Note also that the quantities e d and e' d represent a 
generalization of the MMSE dimension, which, when it exists, defines the high-snr asymptotics of the MMSE of a 
random variable observed in zero-mean unit-variance Gaussian noise 11301 . Numerical results suggest that the limits 
in (l56l l and (f5Tb exist for common multiple-antenna fading coherent channel models driven by arbitrary equiprobable 
discrete inputs, thereby justifying the use of the expansions in d58l l and ( 1591 to characterize the asymptotic behavior. 

It is also important to note that in general we can only bound e d and e' d as in (|48l l and d49l ), respectively, rather 
than compute their exact values. The upper and lower bounds to ed and e' d differ by a factor of 4 • (M — 1), and so 
become increasingly loose for multi-dimensional constellations with high cardinality. The lower bound to e d and the 
upper bound to e d , which are due to the genie based estimator, are considerably loose. In contrast, the upper bound 
to ed and the lower bound to e' d , which are due to the Euclidean distance based estimator, are considerably tighter. 
In general, the use of the bounds to e d and e' d in the expansions in d58l l and (|59l l, respectively, leads to bounds to 
the true asymptotic expansions. The bounds to the asymptotic expansions and the true asymptotic expansions differ 
only by an offset in signal-to-noise ratio. The offset in dB between the high-snr expansion of the average value of 
the MMSE in (EFl l and the expansions that use the lower and upper bounds to the value of ed is given respectively 
by: 



fc LB d ' Ei=l Ej=1 Pi? (0) 



A LB | HR = 10-log 10 I 



and 



"UB d 



•r-^M v-^M (d-l), m ' 

Ei=iEi=U»da (0) 



4 It is possible to construct genie based estimators that lead to tighter bounds. The current genie supplies the receiver with a pair of input 
vectors for each transmit vector, the true input vector and any of the other input vectors with equal probability. A more appropriate genie supplies 
the receiver with the true input vector and another suitable input vector for each transmit vector. In particular, in the canonical i.i.d. Rayleigh 
fading coherent channel, Theorem [5] suggests that the genie ought to minimize as much as possible the sum of the inverse of the squared 
Euclidean distances between the pairs of supplied input vectors. It is also important to guarantee, in addition, that such a genie construction 
leads indeed to a lower bound to the MMSE and hence an upper bound to the mutual information. This is met, for example, by constructing 
a genie that also ensures that the receiver sees all the input vectors with equal probability. We do not pursue this issue further, because the 
Euclidean distance based estimator upper bound to and lower bound to e' d are still tighter than the new genie based estimator lower bound 
to and upper bound to e' d . 
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TABLE I 

SlGNAL-TO-NOISE RATIO OFFSET BETWEEN THE TRUE ASYMPTOTIC EXPANSIONS AND THE BOUNDS TO THE ASYMPTOTIC EXPANSIONS 
FOR MULTIPLE- ANTENNA RAYLEIGH FADING CHANNELS DRIVEN BY 16-QAM INPUTS 



(nt,n r ) 


A lbL 


A UB d B 


A 'lbL 


A UB IdB 


(1,1) 


-6.9 dB 


2.0 dB 


3.9 dB 


-13.8 dB 


(1,2) 


-5.0 dB 


0.9 dB 


1.4 dB 


-7.5 dB 


(1,3) 


-4.0 dB 


0.5 dB 


0.7 dB 


-5.3 dB 


(2,2) 


-9.2 dB 


0.8 dB 


1.2 dB 


-13.8 dB 


(3,3) 


-9.8 dB 


0.8 dB 


1.0 dB 


-13.0 dB 



TABLE II 

SlGNAL-TO-NOISE RATIO OFFSET BETWEEN THE TRUE ASYMPTOTIC EXPANSIONS AND THE BOUNDS TO THE ASYMPTOTIC EXPANSIONS 
FOR SINGLE-ANTENNA RAYLEIGH FADING CHANNELS DRIVEN BY VARIOUS INPUTS 





A LB| dB 


A us| dB 


A L_B 1 dB 


A LIBldB 


BPSK 


-2.4 dB 


0.6 dB 


1.1 dB 


-4.9 dB 


QPSK 


-4.3 dB 


1.1 dB 


2.2 dB 


-8.6 dB 


16-QAM 


-6.9 dB 


2.0 dB 


3.9 dB 


-13.9 dB 


64-QAM 


-9.4 dB 


2.6 dB 


5.3 dB 


-18.7 dB 


256-QAM 


-11.7 dB 


3.3 dB 


6.6 dB 


-23.5 dB 



In turn, the offset in dB between the high-snr expansion of the average value of the mutual information in 
the expansions that use the lower and upper bounds to the value of e d is given respectively by: 



/ k 



a; 



LBldB 



10 • log 



10 



LB d 



l= iLi=iP d 



(d-1) 



(0) 



and 



(63) 



and 



/ fci 



a; 



UB I dB 



= 10 • lot 



10 



UB d 



i= i Li=ip d 





(d-i; 



(0) 



(64) 



V 



Tables Q] and [XT] show the signal-to-noise ratio offset values for particular system configurations. We observe that 
indeed AuB| dB or A' LB | dB are much lower than Alb^ or A' UB | dB , respectively, which is a manifestation of the 
fact that the upper bound to ed and the lower bound to e d are considerably tighter that the lower bound to e d and 
the upper bound to e' d . We also observe that the signal-to-noise ratio offset values AuB| dB and A' LB | dB decrease 
with the increase in the number of antennas and, as expected, increase with the increase in the multi-dimensional 
constellation cardinality. As an example, it is interesting to note that A UB | dB = 0.8 dB and A' LB | dB = 1.2 dB for 
a two-transmit two-receive antenna canonical i.i.d. Rayleigh fading coherent channel driven by 16-QAM inputs. 

Figures Q] [2] and [3] depict the average value of the MMSE and the average value of the mutual information and 
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the respective asymptotic expansions for key multiple-antenna systems and models. |f| We observe a very reasonable 
match between the exact curve, obtained through Monte Carlo simulations, and the asymptotic expansion in the 
high-snr regime. We also observe that the asymptotic expansions characterize perfectly the rates at which the average 
MMSE and the average mutual information approach their infinite snr values. In general, the use of the high-snr 
asymptotic expansions of the average MMSE and the average mutual information results in errors that relate to the 
fact that: z) one can only compute the leading term rather than other possible higher-order terms in the high-snr 
asymptotic expansions; and z'z) one can only compute analytically bounds to and e' d . The first consideration 
leads to the differences between the asymptotic expansions and the exact curve in Figures [TJ [2] and [3] The second 
consideration leads to the additional signal-to-noise ratio offsets in Tables Q] and [TT] Nonetheless, it will be shown 
that the asymptotic expansions still produce very accurate designs in Section |VI] 

The regime of validity of the asymptotic expansions in certain fading models also deserves more attention. The 
Ricean fading model is an intriguing case study. For example, it is well known that with the increase in the K- 
factor a Ricean fading channel will approach an AWGN-like channel, so that one would expect the average MMSE 
and the average mutual information to tend to their infinite-snr values at a much faster rate than d + 1 and d, 
respectively, as put forth in Theorems Q] and [2] Figures [4] and [5] overcome this apparent contradiction by showing 
that the average MMSE and the average mutual information do indeed tend to their infinite-snr values at rates 
d + 1 and d, respectively. However, the influence of such terms in the expansions only shows up at extremely high 
signal-to-noise ratios; other possible terms in the expansion - which mimic more closely the AWGN-like behavior - 
play a more prominent role at moderate and high signal-to-noise ratios. Obviously, in such circumstances, typically 
associated with larger K-factors, the expansions will be of little practical use because they fail to characterize the 
behavior of the quantities in regimes of interest. 

Another interesting issue relates to possible generalizations of the expansions in (|58T > and (I59t . In view of the 
expansions of the upper and lower bounds to the average value of the MMSE as well as the upper and lower 
bounds to the average value of the mutual information put forth in Lemmas [3] and |4] and the arguments that lead 
to Theorems Q] and [2] it is also tempting to conjecture that the average MMSE behaves as: 




(65) 



for arbitrary N, with 




(66) 



whereas the average mutual information behaves as: 




(67) 



5 We use the exact values of t A and e' d , obtained through Monte Carlo simulations, in the asymptotic expansions. 
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(c) 

Fig. l.lxl Rayleigh fading coherent channel with a 16-QAM input: a) average MMSE; b) average mutual information; (c) difference between 
maximum average mutual information and average mutual information. 
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(c) 

Fig. 2. 2 X 2 Rayleigh fading coherent channel with 16-QAM inputs: a) average MMSE; b) average mutual information; (c) difference between 
maximum average mutual information and average mutual information. 
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(c) 

Fig. 3. 3 X 3 Rayleigh fading coherent channel with 16-QAM inputs: a) average MMSE; b) average mutual information; (c) difference between 
maximum average mutual information and average mutual information. 
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(c) 

Fig. 5. 1 X 1 Ricean fading coherent channel with a 16-QAM input (K = 4, Ho = 1): a) average MMSE; b) average mutual information; (c) 
difference between maximum average mutual information and average mutual information. 
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for arbitrary N with 

fe-i 



lim snr fc ■ ( log M - T(snr) - V el ■ — ^ | (68) 



k'=d 



Expansions d65l l and d67b would then provide a dissection of the high-snr behavior of the average value of the 
MMSE and the average value of the mutual information of arbitrary equiprobable discrete inputs observed through 
a multiple-antenna fading coherent channel in Gaussian noise in terms of a series of quantities, namely, and e' k 

for k = d,d + 1, d + 2, Interestingly, though our analytic techniques do not lead to d65l l and d67b . Figures [6] 

and |7] show that such expansions provide a more accurate representation of the average value of the MMSE and 
the average value of the mutual information than the expansions in d58l and (1591 . 

Finally, it is also of interest to characterize the asymptotic behavior of the probability of error in a multiple- 
antenna fading coherent channel with arbitrary equiprobable discrete inputs in the regime of high snr. We consider 
upper and lower bounds to the (uncoded) probability of error associated with a maximum likelihood detector for 
the channel model in for a fixed channel matrix, given by: 

P e (snr; H) = Pr (x ML (V^ • Hx + n) ^ x|H) (69) 

as well as upper and lower bounds to the average value of the (uncoded) probability of error associated with the 
maximum-likelihood detector for the channel model in (@), for a random channel matrix, given by: 

P e (snr) = E H {P e (snr; H)} = E H {Pr (x ML {V^~r • Hx + n) ^ x|H) } (70) 

where xml (\/snr ■ Hx + n) corresponds to the maximum-likelihood detector estimate of the input given the output 
of the channel model in (0|. We also consider the asymptotic expansions of the upper and lower bounds to the 
average value of the probability of error. Lemmas [5] and [6] summarize the results. 

Lemma 5: The (uncoded) probability of error associated with maximum likelihood detection in the channel model 
in (0), for a fixed channel matrix, can be bounded as follows: 

P 6LB (snr; H) < P e (snr; H) < P euB (snr; H) (71) 

where the lower and upper bounds are given by: 



M M 



fd? (H)snr 



i=l j = l 



M M 



'd? (H)snr 



P euB (snr;H) = l^^i.erfc| 



i=l 3 = 1 



Proof: See Appendix F. 
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(b) 

Fig. 6. 1 X 1 Rayleigh fading coherent channel with a 16-QAM input : a) higher-order expansions of average MMSE; b) higher-order expansions 
of average mutual information. 
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(b) 

Fig. 7. 2 X 2 Rayleigh fading coherent channel with 16-QAM inputs : a) higher-order expansions of average MMSE; b) higher-order expansions 
of average mutual information. 
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Lemma 6: Assume that pw (d?) ,n = 0, 1,2, . . . are continuous and integrable in [0, oo). Then, the average 
value of the (uncoded) probability of error associated with maximum likelihood detection in the channel model in 
©, for a random channel matrix, can be bounded as follows: 



Pe LB (snr) < P e (snr) < P euB (snr) 
where the asymptotic expansion as snr — s- oo of the lower and upper bounds are given by: 



(74) 



snr = 



N 

E 

71=0 



snr 



n+l 



snr = 



N 1 
£-i snr n+1 



k" 



k" 

K UB„+i 



M M 



n=0 



i=l j=l 



M M 



i=l 3=1 



O 



O 



1 



r N+2 



1 



snr 



Af+2 



N = 0,1, 



N = 0,1, 



(75) 



(76) 



and 



k" 



1 

2M(M - 1) 



— — — ■ — 

K UB — — 



P r (n + 1/2) 
' T(n + 1) 

r(n+l/2) 



2M 



7T r(n + l) 



(77) 
(78) 



and T (•) is the Gamma function. 

Proof: See Appendix G. ■ 

Note that as snr — > oo the upper and lower bounds to the average value of the probability of error are arbitrarily 
tight so that: 



Pe(snr) = O 



1 



snr 



1 



snr 



(79) 



We immediately recognize the quantity d, which also appears in the expansions of the average value of the MMSE 
and the average value of the mutual information in Theorems Q] and |2j respectively, to be the familiar diversity gain 
given by (3TJ: 

log P e (snr) 



lim 

snr— > oo 



(80) 



log snr 

Interestingly, the high-snr asymptotic behavior of the average value of the minimum mean-squared error, the 
average value of the mutual information and also the average value of the probability of error exhibit some common 
features, with the value of the parameter d and the value of the bounds to ed (snr) and e' d (snr) dictated by the 
value of the probability density function of the squared pairwise Euclidean distances at zero, p d 2(0), or the value 
of their higher-order derivatives at zero, p.^(fi), n > 0. This dependency, which has also been very briefly noticed 



in I32I . is intuitive because one could argue that in fading channels at high-snr the quantities are affected primarily 
by the value of the probability of arbitrarily close (noiseless) receive vectors due to unfavorable fading realizations, 
in the same way that in non-fading channels at high-snr the minimum mean-squared error, the mutual information, 
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and the probability of error are primarily affected by the value of the Euclidean distance between the (noiseless) 
receive vectors lfl6l . As a side remark, we observe that the values of p d 2 (0) or pjg (0), n > 0, which depend on the 
channel statistics as well as the system elements, can then be the basis of the characterization or the optimization 
of the constrained capacity of multiple-antenna fading coherent channels driven by arbitrary equiprobable discrete 
inputs in the regime of high snr. Sections HVl IVl and I VII concentrate on such issues. 

IV. The Canonical i.i.d. Rayleigh Fading Coherent Channel 

It is now relevant to characterize the constrained capacity of the canonical i.i.d. Rayleigh fading coherent channel 
with arbitrary equiprobable discrete inputs in the regime of high-snr. The objective is to understand the effect on 
the constrained capacity of various system parameters, such as the number of transmit antennas, the number of 
receive antennas as well as the characteristics of the signalling scheme. 

The following Theorem unveils the high-snr behavior of the constrained capacity. We represent the squared 
pairwise Euclidean distance between two arbitrary transmit vectors x^ and Xj by d? = ||xj — Xj|| 2 . 

Theorem 3: In the regime of high-snr, the constrained capacity of the canonical i.i.d. multiple-transmit-multiple- 
receive antenna Rayleigh fading coherent channel with arbitrary equiprobable discrete inputs obeys: 

T(snr) = log M - e' (snr) • — -- + O ( ^—] (81) 

y J 6 rir x ' snr"- V snr "' / 

where 

M M / \ MM 

fc k-EE i <CW<^-EE(i] (82) 





and k' LB and k' UBn are given by OTT l and d32l . respectively. 

Proof: The basis of the proof is the calculation of the probability density function of the squared pairwise 
Euclidean distance between two arbitrary (noiseless) receive vectors given by: 

d 2 = tr (H w A y Ht) (83) 

where Ay = (x.; — Xj)(xi — XjY . Denote the eigenvalue decomposition of the positive semi-definite matrix Ay 
by Ay = WyAyWj-, where Wjj is a unitary matrix and Ay = diag(Ay,0, • • • , 0) is a diagonal matrix with 
a single non-zero diagonal element Ay = tr (Ay) ~ tr (Ay) = tr ((x^ — Xj)(x; — Xj)^) = ||(xj — Xj)|| 2 = djj 
(note that there is a single non-zero diagonal element Ay > because the matrix Ay is unit rank). It is possible 
to show (see 0) mat the distribution of 

tr (H w AyHt) = tr ( M W \V, ; A,,W;,]] W ) (84) 

is equal to the distribution of 

n r 

tr(H w AyHt) =51 Ay |6 1 2 (85) 



25 



where k = 1, . . . , n r , are independent circularly symmetric complex Gaussian random variables with zero-mean 
and unit-variance. This is a central chi-square (or gamma) distribution with 2n r degrees of freedom with probability 
density function given by DTI : 

Pd ^ = A^(n r -1)! ■ (d ^"" _1 ' 6 "^' ^ " ° (86) 
The high-snr expansion of the constrained capacity follows immediately from Theorem [2] by noting that the 
function x n r( * _iy ' (dy) r 1 e Xij and its higher-order derivatives are continuous and integrable on [0, oo), 
P f (0) = 0, k = 0, . . . , n r - 2, and p^ r ~ l) (0) = i + 0. ■ 

ij ij ^j 

Theorem [3] defines the impact of the number of transmit and receive antennas as well as the characteristics of the 
signalling scheme on the constrained capacity of the canonical i.i.d. Rayleigh fading coherent channel with arbitrary 
equiprobable discrete inputs, in the regime of high snr. In particular, in view of the bounds in d82t . this Theorem 
is consistent with the fact that constellations with poor sphere-packing properties require more signal-to-noise ratio 
than constellation with good sphere-packing properties in order to achieve a certain target constrained capacity in 
the regime of high snr. 

The number of receive antennas controls the rate at which the constrained capacity value tends to its infinite-snr 
value, so that a multiple-receive antenna system requires less signal-to-noise ratio than a single-receive antenna 
system in order to achieve a certain target constrained capacity in the regime of high snr. The number of transmit 
antennas, as opposed to its effect on the capacity of the canonical i.i.d. multiple-antenna Rayleigh fading coherent 
channel (O, does not affect directly the multiplexing ability of the system because the infinite-snr constrained 
capacity value depends solely on the number of constellation points or vectors. Instead, a higher-dimensional 
(complex) space, in comparison to a lower-dimensional one, enables the construction of more efficiently packed 
multi-dimensional constellations 031 . This, in view of the bounds in (1821 . suggests that a multiple-transmit antenna 
system requires potentially less signal-to-noise ratio than a single-transmit antenna system in order to achieve a 
certain target constrained capacity in the regime of high snr. Figures [8] and [9] which illustrate these aspects, also 
demonstrate that the asymptotic expansions capture this behavior of the constrained capacity in the regime of 
high-snr. 

Theorem [3] also suggests that appropriate constellations for communication over the canonical i.i.d. multiple- 
antenna Rayleigh fading coherent channel - in the sense of optimization of the constrained capacity - optimize the 
quantity: 

M M 

EE 

i=i i=i 

where d^ in = min^ djj represents the minimum squared Euclidean distance between pairs of constellation points 
or vectors and K represents the number of pairs of points or vectors with pairwise squared Euclidean distance 
equal to the minimum squared Euclidean distance, a quantity akin to the kissing number. Interestingly, a different 
compromise appears to exist between K and d^ in in canonical i.i.d Rayleigh fading coherent channels where only 
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Fig. 8. Comparison of constrained capacity for 1 X 1, 1 X 2 and 1x3 Rayleigh fading coherent channels with 16-QAM inputs. 
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Fig. 9. Comparison of constrained capacity for a 1 X 2 Rayleigh fading coherent channel with a 256-QAM input and a 2 X 2 Rayleigh fading 
coherent channel with 16-QAM inputs. 
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the receiver knows the channel state than in fading channels where both the transmitter and the receiver know 
the channel state. Note that, when the channel matrix is known to the transmitter and the receiver, the high-snr 
expansion of the constrained capacity depends exponentially on d^ in fl6l . 

V. Other Fading Coherent Channels 

It is also relevant to characterize the constrained capacity of non-canonical fading coherent channels with arbitrary 
equiprobable discrete inputs in the regime of high snr. We study models that arise in many practical scenarios such 
as: i) the antenna-correlated Rayleigh fading coherent channel; and it) the Ricean fading coherent channel, j] 

A. Antenna-Correlated Rayleigh Fading Channels 

We write A^- = (x,- — Xj)(x,- — Xj)t. We denote the eigenvalue decomposition of the positive semi-definite matrix 

a i i i i 
© T A,j0 T by T Ay0 T = Wxy Ax y W T , where Wxy is a unitary matrix and Ax y = diag^T^ , 0, . . . , 0) 

is a diagonal matrix with at most a single non-zero diagonal element Aiy > (note that there is at most a single 

i i 

non-zero diagonal element At^ > because the matrix T Ajj0 T is at most unit rank). We also denote the 
eigenvalue decomposition of the positive semi-definite matrix 0r by 0r = WrArW r , where Wr is a unitary 
matrix and Ar = diag(A/? t ,) is a diagonal matrix with diagonal elements Xn k > 0, k = 1, . . . , n r . 

The following Theorem, which represents a generalization of Theorem [3] defines the high-snr behavior of the 
constrained capacity of antenna-correlated Rayleigh fading coherent channels. The Theorem concentrates on non- 
degenerate scenarios where Aj^ > 0, V i ^= j, and Xji k > 0, V k. 

Theorem 4: In the regime of high-snr, the constrained capacity of the n r x n t antenna-correlated Rayleigh fading 
coherent channel with arbitrary equiprobable discrete inputs obeys: 

T(snr)=logM-e' ( sm ).J—+o(— ^- T ] (88) 



i=l =1 ^ AT *i / Rk I j=l ]=1 \ AT *i J \k=l Rk I 

and k' LBn and k' UBn are given by OTb and d32l , respectively. 

Proof: The squared pairwise Euclidean distance between two arbitrary (noiseless) receive vectors is given by: 

= tr(0 R H w T A lJ 0|Ht v ©|) (90) 

6 We consider for simplicity exclusively Ricean fading channels with no transmit or receive antenna correlation. 

7 The Theorem also considers only scenarios where the eigenvalues Xn k ,k = 1, ...,n r , are either all distinct or all equal (necessarily 
to one). The Theorem does not consider scenarios where there are groups of identical eigenvalues. This generalization requires considerable 
algebraic manipulation without adding much relevant insight. Note that \r 1 = \r 2 = • • ■ = Xji n = 1 corresponds to a scenario where the 
receive antennas are uncorrelated. 



where 
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It is possible to show (see |3'0 that the distribution of 

tr(e|H w eJ AyejHtei) = tr (WrAIw^Wt/^W;, HtW R A| W^) (91) 
is equal to the distribution of 

tr (A|H W A Tij Hi A|) = >T A Ty A flfc |& 1 2 (92) 



fe=i 

where fc = 1, . . . , n r , are independent circularly symmetric complex Gaussian random variables with zero-mean 
and unit- variance. 

Assume that the eigenvalues of the receive correlation matrix are all equal (necessarily to one), i.e., A^ = \r 2 = 
■ • • = A/f n = 1. Then, the probability density function of djj is given by PD : 

^-^(Z-l)!-^""'-*"^ d »-° <93) 

The high-snr characterization of the constrained capacity follows from Theorem [2] by noting that 

p«(0)=0, fc = 0,...,n r -2 (94) 

and 

P^ 1} (0)-t^-^0 (95) 

" Tij 

Assume now that the eigenvalues of the receive correlation matrix are all distinct, i.e., ^ \r 2 ^ ■ ■ ■ ^ Xr„ . 
Then, the probability density function of is given by 1341 : 

**<® X x . ' <\,; . \ ^ 4>0 (96) 

k=i X Tij X Rk [[ k '=l i 



A 



k'^k v 

The high-snr characterization of the constrained capacity also follows from Theorem |2j by noting that 

p§\0) = 0, fe = 0,...,n r -2 (97) 

u 

and 

^r" 1) (0) = I 4--nT L ^ C98) 
It is also simple to show that the conditions for the application of Theorem [2] are satisfied, i.e., the functions: 

1 1 -J- 

'd® 1 * -e Ar - (99) 



and 



J2 n 7 aTTY' 6 XTijXR " (100) 

fe=i A Ti3 A flfc IlfcLi ( 1 - "Ar' ' 



and their higher-order derivatives are continuous and integrable on [0, oo). 
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It is also instructive to examine the implications of degenerate conditions where At ;j = for some values of i and 
j and \n k = for some values of k. Assume that the receive correlation matrix ®r has n' non-zero eigenvalues 

An k > and n" = n r — n' zero eigenvalues Xn k = 0, so that there are n" perfectly correlated paths. Assume also 

i i 

that some of the matrices 0^Ay0j have eigenvalue Ar y = 0, so that the vector Xj — Xj lies in the null space of 
the transmit correlation matrix ©t- This is also equivalent to saying that the receiver cannot distinguish some of 
the transmit vectors. By letting logM' = H ^©^H w 0^,x|H w ^ < H (x) = log M , a simple modification of the 
Theorems reveals that as snr — > oo the constrained capacity behaves as follows: 

T(snr) = log M' - e' n , (snr) ■ -L + Q ( —^-) (101) 

snr" V snr™ +i / 



e (tM" ■ n ^<4(snr)<^ v . y, (^-T ■ n ^ ^ 



where 

kL„ . V ( 

. At J A b, ™' \ Ay . / A r, 

and k' LB / and k' UB ( are given by OTT l and (l32l i. respectively. This shows that degenerate scenarios affect the 
constrained capacity infinite-snr value as well as the rate at which the constrained capacity tends to the infinite-snr 
value. 

In general, and the effect of degenerate conditions apart, the bounds in ( T89b suggest that the constrained capacity 
of a Rayleigh fading channel with transmit and receive antenna correlation is lower than the constrained capacity 
of a Rayleigh fading channel without antenna correlation for a certain signal-to-noise ratio in the regime of high 
snr. This is due to the fact that the value of the bounds in the correlated scenario in (|89l l is higher than the value 
of the bounds in the uncorrelated scenario in (|82| |. For example, in the presence of receive antenna correlation, it 
is possible to show that 

T t— > 1 (103) 

fc=l 

due to the fact that the function YYk=i I s Schur-convex and the vector [Arj Xr 2 ■ ■ ■ XR nr ] majorizes the 
vector [11 ■ ■ ■ 1] 11351 . In the presence of transmit antenna correlation, it is possible to show that the convex 
function 

MM /l\ n '' M M / ^ 



hh^J = ^ U((x,-x j) »e T ,x,-x j) ) J 

is minimized by @t = I in the set of unit-diagonal positive semi-definite matrices ©t- This result assumes that 
the constellations exhibit certain common symmetries (see also Section PvTb- Then, 

EE ±) =EE 



, ^TuJ , i \ tr (( x i -Xj-)t0 T (xi -Xj)) 



M M 




This is portrayed in Figure \TU\ 



a Sg U((^,-x J )t(„-x J .)) J -EE 1st I ao 5 ) 
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Fig. 10. Comparison of constrained capacity for a 2 X 2 antenna-uncorrelated Rayleigh fading coherent channel with 16-QAM inputs and a 
2x2 antenna-correlated Rayleigh fading coherent channel with 16-QAM inputs (<&t = [r/s °i 5 ] am ^ *R = [r/s °i 8 ])- 



B. Ricean Fading Channels 

We also write = (x, — Xj)(xj — ^j)- We denote the eigenvalue decomposition of the positive semi- 
definite matrix Ajj by Ajj = WyAyWy, where Wjj is a unitary matrix and Ay = diag(A,j, 0, • • • , 0) is a 
diagonal matrix with a single non-zero diagonal element Ajj = tr (Ay) = tr (Ay) = tr ((x; — Xj)(xj — Xj)TJ = 
ll(x 4 -x,)|| 2 =4 

The following Theorem, which also represents a generalization of Theorem [3] defines the high-snr behavior of 
the constrained capacity of Ricean fading coherent channels. 

Theorem 5: In the regime of high-snr, the constrained capacity of the n r x n t Ricean fading coherent channel 
with arbitrary equiprobable discrete inputs obeys: 

1 „ ( 1 



l(snr) = log M - e' n (snr) 



O 



r n r +l 



(106) 
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where 

M M 



k 'u Bnr EE ■ < *' tr(HoW nn ' w '"» ] 

i=i j=i \ a u ir+i / 

< 4 r (snr) < 

MM/ \ 

fci flnr -EE -2 _l_ • e- frtr ( H » w ^' e !< H ^) (107) 

j=l 3 '=1 V^ij ' / 



J7« 

and fc^ s and k' UB are given by ( f3TT > and d32| l, respectively. 



Proof: The squared pairwise Euclidean distance dj- between two arbitrary (noiseless) receive vectors is given 



by: 



It is also possible to show (see (3|) that the distribution of 



K 1 \ .„ . „.+ / / X 



is equal to the distribution of 



= E A ^i 2 (110) 

k=l 

where £k,k = 1, ...,n r , are independent circularly symmetric complex Gaussian random variables with mean 



j^j- ■ e^Wjj-HoWy ei and variance -^pj, respectively. This is a non-central chi-square (or gamma) distribution 
with 2n r degrees of freedom with probability density function given by 11311 : 



^H-^-U -e^-^-iM-^], d 2 >0 (111) 



where s 2 = \ %0 ■ jfc ■ f ££i etW^HoWyei J A,, • ^ • tr ^HoWtfeielWTHjJ and /„(•) is the 
nth-order modified Bessel function of the first-kind given by: 

W = E fc! r (n + fc + i) -(2) - ^° (112) 

The high-snr characterization of the constrained capacity also follows from Theorem I2 by noting that 

p^ } (0) = 0, fe = 0,...,n r -2 (113) 
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and 



Pd""~ 1} (°) = 7 r^w • e i;7 ^ ^ (H4) 



X . 1 

A y a+i 



It is also simple to show that the conditions for the application of Theorem [2] are satisfied, i.e., the function: 



and its higher-order derivatives are continuous and integrable on [0, oo). ■ 

In general, the bounds in ( 1107b suggest that the constrained capacity of a Ricean fading channel is normally 
higher than the constrained capacity of the canonical i.i.d. Rayleigh fading channel for a certain signal-to-noise 
ratio in the regime of high snr. This is due to the fact that the function 

M M 



jr ( _ 2 1 i j . e --ff-tr(H W„e 1 e+w| 3 .H+ ) = 

, -K-tr(a R a T " l "W 1J e 1 etw+ 3 a T a R t ) 



i=l j=l 
M M 





i 




' A'+l 




1 




1 




' A'+l 




1 




1 




' A+l 



i=l j=l 

M M 

( - - 1 1 ) ■ e -«r-if.tr(aT t Wy e ieIwt j a T ) ( U6) 

is monotonically decreasing in the A'-factor K provided that tr ^aT^WyeielwJ^aT^ > 1, V ! ^ j. This is 
portrayed in Figure [TT] 

However, as shown in Figure [12] and in contrast to the capacity of a multiple-antenna Ricean fading channel lfl2l . 
the constrained capacity of a Ricean fading channel can also be lower than the constrained capacity of a Rayleigh 
fading channel for a certain signal-to-noise ratio in the regime of high snr. This aspect is illustrated further by the 
following toy-examples. 

Example 1: Consider a 2 x 2 Ricean fading channel with two equiprobable discrete inputs xi = [1 0] T and 
x 2 = [0 1] T , where H = j^j ■ H + \/ aTT ' H w and H o = a R a T ^ with a R = [1 1] T and a T = [1 1] T . Then, 

k' UB2 ■ \ ■ (K + l) 2 < e' 2 (snr) < k' LB2 -\-{K+ if (117) 

with k' LB2 and k' UB2 given by OTb and d32l . respectively. 

Example 2: Consider a 2 x 2 Rayleigh fading channel with the two equiprobable discrete inputs xi = [1 0] T 
and x-2 = [0 1] T , where H = H w . Then, 

k'uB 2 ■ \< 4 (^r) < k' LBi ■ \ (118) 

with k' LB2 and k' UB given by OTb and d32l , respectively. 

Examples [T] and [2] suggest that, due to the bounds in ( II 17b and ( II 18b , this Ricean fading channel needs a higher 
signal-to-noise ratio than the Rayleigh fading channel for K > 0, in order to achieve a certain target constrained 
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Fig. 11, Comparison of constrained capacity for a 1 X 1 Rayleigh fading coherent channel with a 16-QAM input and a 1 X 1 Ricean fading 
coherent channel with a 16-QAM input (K = 2, Ho = 1). 

capacity in the regime of high snr. Figure [13] which shows only the simulated constrained capacity, confirms this 
behavior. This is due to the fact that one cannot rely on the deterministic component but rather, in comparison to a 
canonical i.i.d. Rayleigh fading channel, on the lower power random component in order to distinguish the transmit 
vectors. Note that the transmit vector differences Xi — x 2 = [1 — 1] T and x 2 — x x = [—1 1] T are orthogonal to 
the transmit array response ax = [1 1] T - 

VI. Designs 

The focus now is on the design of schemes for communication over multiple-antenna fading coherent channels 
driven by equiprobable discrete inputs, in the regime of high snr. In particular, we conceive designs for: i) optimal 
power allocation over a bank of parallel independent fading coherent channels; ii) optimal precoding for multiple- 
antenna fading coherent channels; and Hi) space-time coding for the multiple-antenna canonical i.i.d. Rayleigh 
fading coherent channel. The design principle is based on the optimization of the lower bound to the asymptotic 
expansion of the constrained capacity, rather than the exact asymptotic expansion of the constrained capacity. It is 
interesting to note though that the procedure leads to very sharp designs. It is also interesting to note that some 
of the design criteria coincide with standard design criteria in the literature, most notably the space-time coding 
criteria, which have been derived based on other principles. 
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Fig. 12. Comparison of constrained capacity for a 2 X 2 Rayleigh fading coherent channel with 16-QAM inputs and a 2 X 2 Ricean fading 
coherent channel with 16-QAM inputs (K = 3,H = [ \ }]). 




Fig. 13. Comparison of constrained capacity for the fading coherent channels in Examples \T\ and [2] (K = 2). 
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A. Power Allocation in a Bank of Parallel Independent Fading Channels 

We consider a bank of K parallel independent fading channels which can be modeled as follows: 

y k = \/in7- h k ^/pT x k + n k , k = l,...,K (119) 

for a single channel use, where y k £ C is the fcth sub-channel complex receive symbol, x k G C is the fcth sub- 
channel complex transmit symbol, h k £ C is the fcth sub-channel random complex fading coefficient and n k € C 
is a circularly symmetric complex Gaussian noise random variable with zero-mean and unit- variance. The variable 
Pk € Kg corresponds to the power injected into the fcth sub-channel. The scaling factor snr relates to the signal-to- 
noise ratio. We assume that the transmit symbols x k ,k = 1, . . . , K, conform to equiprobable unit-power discrete 
constellations with cardinality M^,k = 1, . . . , K. We also assume that all the random variables are independent. 
The constrained capacity of the bank of parallel independent fading channels is given by: 

K K 

T(snr) = ^T fc (snr) = {\{x k ; ^/snr ■ h k y/pk~ x k + n k \h k )} (120) 

k=l k=l 

The objective is to determine the power allocation procedure that maximizes a lower bound to the constrained 
capacity in the asymptotic regime of high snr, subject to the total power constraint: 

A" 

]T> fe <P (121) 

fc=i 

with p k > 0, k = 1, . . . ,K. 

Next, we consider a bank of parallel independent channels subject to either Rayleigh fading or Ricean fading. We 
denote the squared pairwise Euclidean distance between two distinct transmit symbols in a particular sub-channel 

fc, x ki and x k] , by: 

dk u = IK -zfc,f (122) 

We also denote the squared pairwise Euclidean distance between two distinct noiseless receive symbols in a particular 
sub-channel fc, h ky /pk~x ki and hky/PkX kj , by: 

d kij = \\h k y/pk (x ki ~ x k] )\\ 2 =p k \\h k \\ 2 \\xki ^x kj \\ 2 =p k \h k \ 2 dl. (123) 

1) Rayleigh fading case: In the Rayleigh fading case, the complex fading coefficients h k ~ CAf{0, er^ ) , fc = 
1, . . . , K. Therefore, the probability density function of the squared pairwise Euclidean distance between two distinct 
noiseless receive symbols in a particular sub-channel fc is given by: 



The asymptotic characterization of the constrained capacity, which can be found by using Theorem [2] together 
with the properties of ( 1124b , is given by: 

T(snr) = log M - e[ (snr) ■ — + O ( At ) (125) 

snr Vsnr 2 / 
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where M = J]f =i M k and 



J\ IVI k IVI k J\ |Vl k IVI k 

E fc u Bll E E s4— ^ ^ ^ * E *k E E :r4r- 



(126) 



and 



J L 4 « r (V 2 ) 

2M k (M k -l) T(3) 



fc'uB, = 1 , -^=-4"-^ (127) 



*k, =lf •^=•4"-^ (128) 



J_ 4 « r(5/2) 

M k r(3) 

As snr — > oo, it follows from ( 1125b and the upper bound to (snr) in (1126b that the power allocation procedure 
that maximizes the lower bound to the constrained capacity is given by: 

p* k = a ■ — = + o (1) (129) 

/ 1 <T-M k V^ M k H2 ,y2 
M k 2-i=l 2-.?=l u kij CT h fc 



A" 



1 1 1 

a~ 1 = --> ; (130) 

P f-J /T v M k v M k n 2 2 

fe=l 4 / M k 2-1=1 2-1=1 a k ii (T / lfc 



with 



M k i-n=\ £-13 = 

Note that, given equal sub-channel constellations, the higher the average sub-channel strength (i.e., the higher 
u1 k ), then the lower the allocated power. Note also that the power allocation policy embodied in ( 1129b and ( 1130b 
in fact corresponds to the power allocation policy put forth in 11241 . 

Figures [14] and Q3] compare analysis to simulation for a bank of two parallel independent Rayleigh fading 
coherent channels driven by 16-QAM inputs. We observe that the optimal power allocation policy, obtained via 
direct optimization of the Monte-Carlo simulated constrained capacity, tends with the increase of the signal-to-noise 
ratio to the high-snr power allocation policy embodied in dl29b and ( 1130b . We also observe that the constrained 
capacity associated with the optimal power allocation policy tends with the increase of the signal-to-noise ratio to 
the constrained capacity associated with the high-snr power allocation policy. It is important to note though that 
the designs, which are shown to be very good in the regime of high snr in Figures [T4l and [TBI are not sharp for 
parallel independent channels driven by distinct inputs. This is due to the fact that one can only bound rather than 
compute the exact value of the quantity akin to the MMSE dimension, which differs for different inputs. 

2) Ricean fading case: In the Ricean fading case, the complex fading coefficients hk ~ CJ^[fj,h k ,cr^ lk ),k = 
1, . . . , K. Therefore, the probability density function of the squared pairwise Euclidean distance between two distinct 
noiseless receive symbols in a particular sub-channel k is given by: 



Pdl (d 2 kiJ ) = -j-^— • e W ■ I , d 2 kij > (131) 



where s 2 k = d 2 k ■ Pk ■ {^ hk } 2 + %{Vh k } 2 ) and 



W = E o W(¥TT)-(2) < ^° (132) 
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Fig. 14. Optimal power allocation in a bank of two parallel independent Rayleigh fading coherent channels driven by 16-QAM inputs (cr^ = 4 
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Fig. 15. Average mutual information in a bank of two parallel independent Rayleigh fading coherent channels driven by 16-QAM inputs 
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Fig. 16. Optimal power allocation in a bank of two parallel independent Ricean fading coherent channels driven by 16-QAM inputs (pin = 1+3 > 
Vh 2 =1+3, a 2 hl =4and<T ; 2 2 = 1). 
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Fig. 17. Average mutual information in a bank of two parallel independent Ricean fading coherent channels driven by 16-QAM inputs (p-hi 
1 + 3, Hh 2 =l+ 3, <>t = 4 and a\ = 1). 
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The asymptotic characterization of the constrained capacity, which can also be found by using Theorem [2] together 
with the properties of (1131b . is given by: 



l(snr) = logM-e' 1 (snr)- — + O ( ) (133) 

snr V snr 2 1 



where M = J]fcLi M k and 



K M k M k s {»h k Y + *{»h k Y 

2 

CT I 

e h t 



fe=l i=l 

< A (snr) < 



K M k M k J ,R {^ k } + *{»h k } 

fe=l i=l 3=1 a krj%P fc 



and 



fc' 1 1 d n r (5/2) 

UBl * 2M k (M k -l) T(3) 



(135) 



^ = ^ ■ 4= • 4" • ^ (136) 



JL J_ 4 « r ( 5 / 2 ) 
M k 0? r(3) 

As snr — > oo, it follows from ( 1133b and the upper bound to e[ (snr) in (11341 i that the power allocation procedure 
that maximizes the lower bound to the constrained capacity is given by: 

pj = a-—^= " =+o(l) (137) 

j_ Mk v M k Ti 2 

with 

B KJ +a KJ 

1 K ^1 

q~ =H-y] i ( 138 ) 

P f-f /j_ V M k v M k T 2 2 

fc=i W ^2^=12^=1%°)^ 



Note now that the Ricean K-factor ^5R{/i/ ifc } + 3{/i/ lfc } J j af lk has a direct impact on the power allocation 
procedure (compare (11291 l and ( 1130b to (11371 ) and (11381 )). 

Figures [16] and [T7] which compare analysis to simulation for a bank of two parallel independent Ricean fading 
coherent channels driven by 16-QAM inputs, also demonstrate that the optimal designs tend with the increase of 
the signal-to-noise ratio to the high-snr designs. @ It is also important to note that the designs, which are shown to 
be very good in the regime of high snr in Figures Q~6] and [TTJ are also not sharp for parallel independent channels 
driven by distinct inputs, as discussed previously. 

8 Note that a more substantial difference between the constrained capacity associated with the optimal power allocation and the constrained 
capacity associated with the high-snr power allocation is expected for a bank of parallel independent fading coherent channels with more than 
two sub-channels. The simulations, however, would be very time consuming. 
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B. Precoding in a Multiple-Antenna Fading Channel 

We consider a linearly-precoded multiple-antenna fading channel which can be modeled as follows: 

y = \/snr'HPx + n (139) 

for a single channel use, where y e C" r is the vector of complex receive symbols, x G C" f is the vector of 
complex transmit symbols, H € C n, ' x "' is the random channel fading matrix (with E{tr(HH^)} = n t n r ), 
and n € C" r is a vector of independent circularly symmetric complex Gaussian noise random variables with 
zero-mean and unit- variance. The matrix P € c™* x ™* represents a linear precoder^ The scaling factor snr relates 
to the signal-to-noise ratio. We assume that the transmit vector conforms to an equiprobable multi-dimensional 
constellation with cardinality M, i.e., x g {xi, X2, . . . , xm} and Pr(xi) = Pr(x2) = ■•• = Pi'(xm) = ^, with 
S x = E jxx'} = i • I. We also assume that all the random variables are independent. 

The objective is to determine the precoder that maximizes a lower bound to the constrained capacity in the 
asymptotic regime of high snr, subject to the total power constraint: 

tr(PP f ) < P (140) 

We illustrate the design procedure for the canonical i.i.d. Rayleigh fading coherent channel and the antenna- 
correlated Rayleigh fading coherent channel. In particular, the availability of closed-form expressions for the bounds 
to the asymptotic expansions of the constrained capacity, which embody the effect of the precoder, leads to simple 
design methods based on numerical procedures - and occasionally analytic results - rather than time-consuming 
Monte Carlo simulation procedures. 

1) The Canonical i.i.d. Rayleigh Fading Channel: In this channel model, where H — H w , the squared pairwise 
Euclidean distance between two arbitrary noiseless receive vectors is given by: 

djj = tr (H W P ( Xi - Xj ) ( Xi - Xj -) f PtHU (141) 



u 

and its probability density function is given by: 



K- 1 ) 1 ftrf(x l -x,) t PtP(x. i -x 



■e K(**-*i) tptp (*'-*;)) , d?>0 (142) 

This, together with Theorem[2] leads to the high-snr characterization of the constrained capacity, which is a function 
of the precoder, given by: 

T(snr) = log M - e' n (snr) ■ + O ( (143) 

y ' b n r \ i snr „ r ^snr"^ 1 / 

9 We consider without any loss of generality the matrix P to be square. 
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where 



M M 



k UB nr ■ E E 



1 



tj=i \tr((x. 1 -x,) t PtP(x J -x J ) 



< 4 r (snr) < 

M M 



fc U EE -? r v (144) 

^ J ttltrf(x l -x J )tpt P (x J -x J ) X 1 



and 



k' 1 J_ 4 n r ("r + 3/2) 

* UB - _ 2M (M - 1) rK + 2) (145) 



fc^ = -±.^.4"- i^-^ (146) 



1 1 r r K + 3/ 2 ) 

M 0f r (n r + 2) 

Therefore, we pose the optimization problem, which is equivalent to the maximization of the lower bound (or the 
upper bound) to the constrained capacity of the canonical i.i.d. Rayleigh fading coherent channel in the asymptotic 
regime of high snr subject to a total power constraint, given by: 

M M 



min V V — , r- (147) 

p S J ttltr((x i -x i ) t PtP(x i -x J ))y 

with: 

tr (PP 1 ) < P (148) 

This optimization problem leads immediately to a simple precoder design procedure, that bypasses the need for 
time-consuming Monte Carlo simulations, based on numerical or analytical techniques. We illustrate the analysis 
by assuming that the individual precoder inputs conform to the same equiprobable symmetric constellation, e.g., 
some PSK or some QAM constellation. 

By introducing the change of variables Z = P^P, we pose an optimization problem equivalent to the optimization 
problem in ( 11471 ) and ( 11481 as follows: 

M 



E(— ^ -) w 

subject to: 



mm 

z i, j= i \tr ( (xj - Xj) 1 Z (x, - Xj 



tr(Z)<P (150) 
Z^O (151) 



10 Note that this analysis only seems to be applicable to scenarios where the individual precoder inputs conform to the same equiprobable 
symmetric constellation, rather than scenarios where the individual precoder inputs conform to distinct constellations. 
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Note that this represents a convex optimization problem because: ;) the objective function is convex in Z y 
(Z 7^ 0); ii) the constraint set is convex. Define the Lagrangian of the optimization problem as follows: 

C{Z,*,\)=Y,[— f " IV ) -A-(P-tr(Z))-tr(*Z) (152) 

i,i=i \ tr ( ( Xl - x i) Z ( x * _ x j) ) / 

The Karush-Kuhn-Tucker conditions, which are both necessary and sufficient, state that the optimal solution Z* is 
such that: 

Vz£(Z,*,A)|z=z. = 

/ \ n r +l 

M 



-n r ■ 



E -7 A ' ( x * - x j) ( x * - **) f + A • I - * = (153) 



\ tr ( (xj - Xj-) Z* (xj - xj) 1 



tr(*Z*)=0, * ^ 0, Z* h (154) 

A • (P - tr(Z*)) = 0, A>0 (155) 
It is possible to show that Z* = • I without any loss of generality, because Z* = • I together with 

(\ n r +l 
— — J\ •|x i (A;)-x J (fc)| 2 (156) 

which does not depend on k due to the symmetry conditions, and * = satisfy the Karush-Kuhn-Tucker conditions. 
Note that x(fc) represents the kth element of the vector x. It is only necessary to prove that: 

(\ n r +l 
—, ^ -A ■(x i -x J )(x i -x,) t = A-I (157) 
tr((x l -x J )(x;-x J ) T J 

Note that the element in the m-th row and n-th column of the matrix on the left hand side of ( 11571 ) is given by: 



E \ —( A ) •(x i (m)-x j (m))(x i (n)-x J (n)) t (158) 

1 V tr (xj - Xj) (xj - Xj) 1 



In view of the symmetry conditions, for fixed m and n it is possible to divide the set of pairs of indices 
: i,j = 1, • • • , M,i 7^ j}, which contains M • (M — 1) indices pairs, into i- M-(M — 1) sets of pairs of indices 
{(ii)ii)) («2,j2)}i which contain only two indices pairs, with the property that x.^m) = — x, 2 (m), x 3 - 1 (m) = 
— x j2 (m), Xjj (fc) = Xj 2 (fc), V fc 7^ m, and x 3l (fc) = x j2 (fc), V fc 7^ to. Then, it follows immediately that: 

)n r + l 
• (x n (m) - Xj t (to)) (x 4l (n) - x^ (n)) f + 

it I ' (^2 (m) - x ja (to)) (x 42 (n) - x ja = 

tr (Jx ?2 -x i2 )(xi 2 -x j2 ) T 
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and that 

n r + l 




' •(x l (m)-x J (m))(x J (n)-x J (n)) t = (160) 



tr ( (xj - Xj) (xj - Xj) 



Note also that the n-the diagonal element of the matrix on the left hand side of ( 1157b is given by: 

n r + l 



M M 



EE ( — 7: ~ ) •(x i (n)-x i (n))(x i (n)-x J (n)) t (161) 



i=l3=X V tr (( Xi " X j)( Xi_X i 

In view of the symmetry conditions, it also follows immediately that this quantity is independent of n. 

The fact that Z* = ^- ■ I without any loss of generality leads directly to the form of the precoder that maximizes 
the lower bound to the constrained capacity as follows: 

P* = — QI (162) 

n t 

where Q is any unitary matrix. This result is very intuitive due to the symmetry of the system model. 

Figure [18] shows that the constrained capacity achieved by the optimal precoder, obtained via direct optimization 
of the Monte-Carlo simulated constrained capacity, appears to be equal to the constrained capacity achieved by the 
high-snr isotropic precoder across the signal-to-noise ratio range. This suggests that the approach is sharp in the 
sense that it captures well the effect of the precoder on the constrained capacity of the canonical i.i.d. Rayleigh 
fading coherent channel. 

i i 

2) The Antenna-Correlated Rayleigh Fading Channel: In this channel model, where H = 0^H W ©^,, the 
squared pairwise Euclidean distance between two arbitrary noiseless receive vectors is given by: 

djj = tr ^©|h w 0|P (Xi - Xj) ( Xi - x,) 1 pt©|,Ht©4j ( 163 ) 
and its probability density function is given by: 

Pd?(djj) = 7 777 ' -1—, 7 " ' ( d ij) 

J (tr((x 4 -x,) t Pt© T P(x. 1 -x,)" X 

if ,\((, i -, j )tpt; T P(, i -, j )) i d 2> Q (164) 

when the eigenvalues of the receive correlation matrix, A ^ , Xr 2 , . . . , , are all equal (necessarily to one) or 



^(4> = E 



fi ^ UlU l 1 - • tr ((* - xj? pt0 TP (xi - x 



■e ^^^-'il'^^^-i)), d?>0 (165) 
when the eigenvalues of the receive correlation matrix, Xr ± , Xr 2 , ■ ■ ■ , X R nr , are all distinct 

"We once again concentrate on non-degenerate scenarios where Aj^. > 0,V i ^ j, and Ajj fc > 0, V k, and on scenarios where the 
eigenvalues Afj fc , k = 1, . . . , n r , are either all distinct or all equal (necessarily to one). 
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Fig. 18. Average mutual information for a 2 X 2 canonical i.i.d. Rayleigh fading coherent channel driven by QPSK inputs. 



This, together with Theorem [2] also leads to the high-snr characterization of the constrained capacity, which is 
a function of the precoder, given by: 



where 



and 



l(snr) = log M - e' n (snr) 



M M 

k uB nr ■ ^2 

»=1 3=1 

< 4„ (snr) < 

M M 

i=l 3=1 



UB„ 



1 



O 



1 



tr ( (xi - x,) 1 Pt0 T P (x< - Xj -) ) / det (® 



tr ((x, - Xi ) f Pt© T P (m - x,)) / det 



1 4 „ r(n r + 3/2) 



2M(M-1) T(n r + 2) 



2 1 4 „ TK + 3/2) 



(166) 



(167) 



(168) 



(169) 



" LB - M ' y^r T (n r + 2) 

Therefore, we pose the optimization problem, which is equivalent to the maximization of the lower bound (or 
the upper bound) to the constrained capacity of the antenna-correlated Rayleigh fading coherent channel in the 
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asymptotic regime of high snr subject to a total power constraint, given by: 

mm/ 1 \ 



I 




tiU l v tr((x l -x,) t Pt0 T P(x l -x,)) 



det (0 R ) 



(170) 



with: 



tr (PPt) < P 



(I7l) 



This optimization problem, akin to the previous one, also leads immediately to a simple precoder design procedure 
based on numerical techniques. Most notably, it is possible to prove from fll70t and dl7l| ), using the techniques in 
l35l . that the matrix containing the left singular vectors of the precoder matrix corresponds to the matrix containing 
the right singular vectors of the transmit-antennas correlation matrix, i.e., the optimal precoder diagonalizes the 
transmit-antenna correlation matrix. It is also possible to prove from ( II 70! ) and ( Il7 lb that, upon setting the matrix 
containing the left singular vectors of the precoder matrix to be equal to the matrix containing the right singular 
vectors of the transmit-antenna correlation matrix, the optimization problem becomes concave in the precoder 
squared singular values. These facts have also be been recently established by different means in l36l . 

Figure [19] shows that the constrained capacity achieved by the optimal precoder design, obtained via direct 
optimization of the Monte-Carlo simulated constrained capacity, tends with the increase of the signal-to-noise ratio 
to the constrained capacity achieved by the high-snr precoder design, obtained via the optimization problem in 
( II 70! ) and ( I171I ). This suggests once again that the design approach is sharp in the sense that it also captures well 
the effect of the precoder on the constrained capacity of the antenna-correlated Rayleigh fading coherent channel. 

C. Space-Time Coding in the Multiple -Antenna Canonical i.i.d. Rayleigh Fading Channel 

We now consider a more general multiple-antenna fading channel, which encompasses communication over t 
symbol intervals, given by: 



where Y <G C" rX ' denotes the receive codeword matrix, X <G C' ltXt denotes the transmit codeword matrix, 
H w € C" rX ™ t is the canonical i.i.d. complex Gaussian random channel fading matrix, and N € C'" rXi is the 
noise matrix consisting of independent circularly symmetric complex Gaussian random variables with zero-mean 
and unit- variance. We assume that the channel matrix is constant for the duration of a codeword, changing from 
codeword to codeword in a stationary and ergodic manner. We also assume that the receiver knows the exact channel 
matrix realization but the transmitter knows only the channel matrix distribution. The space-time codeword matrices 
conform to an equiprobable multi-dimensional constellation with cardinality M, i.e., X e {X l5 X2, . . . ,Xm} and 



By capitalizing on Theorem [2] the high-snr expansion of the constrained capacity in nats per channel use, where 
a channel use encompasses t symbol intervals, obeys: 




(172) 



Pr(X 1 )=Pr(X 2 ) 



Pr (X M ) - ±. 




(173) 
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Fig. 19. Average mutual information for a 2 X 2 antenna-correlated Rayleigh fading coherent channel driven by QPSK inputs 
(* T = [ l 5 i5] and# R =[ l 8 1 8] ) . 



where 



and 



M M 

d = 1 + min <{ n G No : ]T J^PtfH ) 

i=l j=l " 



M M 
\ - \ - ,(d-X) 



M M 
\ - \ - (d-1). 



fc ue d • U ^>5 - L '(0) < 4 (snr) < fc^ Bd • ^ "(0) 



i=l 3 = 1 



i=l j=l 



2_ r(d + 3/2) 

" LBd ~ M ' ' r (d + 2) 



1 



r(d + 3/2) 



UB d 



(174) 



(175) 



(176) 



(177) 



2M(M-1) T(d + 2) 

and p&(-) and p2\-) denote the probability density function and the higher-order derivatives of the probability 

■i ij 

density function of the squared Euclidean distance between pairs of arbitrary noiseless receive codeword matrices. 

It is evident that the exact value of d and the bounds to the value of e' d (snr) are connected to the properties of 
the set of space-time codewords. The objective is to unveil the structure of the set of space-time codewords that 
lead to: i) the maximization of the exact value of d; and ii) the minimization of the upper bound (as well as the 
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lower bound) to the value of e' d (snr). This leads to the maximization of the lower bound to the constrained capacity 
in the asymptotic regime of high-snr regime. 

The following Theorem summarizes the space-time code design criteria for the canonical i.i.d. multiple-antenna 
Rayleigh fading coherent channel, with two transmit antennas and one or two receive antennas. We denote Ay = 
(Xj — Xj)(Xj — Xj)^. We also denote the eigenvalue decomposition of the matrix Ay by Ay = WAjjW', 
where W is a unitary matrix, Ay = diag(Ay (1) , . . . , Ay (ry) , 0, . . . , 0) is a diagonal matrix, Ay (1) > ■ ■ • > 
Ay (ry) > are the non-zero eigenvalues of Ay, and ry = rank (Ay) < mm (n t ,t) is the rank of Ay. We let 
Rij < fij represent the number of distinct eigenvalues in Ay (1) , . . . , Ay (ry). We also let Ay- (1) , . . . , Ay (-Ry) 
and r-^-(l),..., ry (i?y) represent the distinct eigenvalues and their multiplicities, respectively, where ry (1) + 
■••+ry (iZy) = ry. 

Theorem 6: Consider the canonical i.i.d. multiple-antenna Rayleigh fading coherent channel model in (1172b with 
nt = 2 and n r = 1 or n r = 2, where X G {Xi, X 2 , . . . , X M } and Pr (X = Xi) = Pr (X = X 2 ) = ■ ■ • = 
Pr(X = X M ) = fa. Then, 

• The value of d is maximized by the set of space-time codewords that maximize: 

r m in = minry (178) 

• The value of the bounds to e' d (snr) are maximized by the set of space-time codewords that maximize: 

£ nf^r (179) 

{i y }en r=i \A*Ar)/ 

where ft = G {1, . . . , M} x {1, . . . , M} : ry = r mi „,»^ j}. 

Proof: Let us consider the squared Euclidean distance between a pair of arbitrary (noiseless) receive codeword 
matrices associated with the pair of transmit codeword matrices Xj and Xj given by: 



dfj = tr^H w AyH w T J (180) 
It is straightforward to show that the distribution of 

tr(H w AyHO =tr(H w WAyWtHO (181) 



is equal to the distribution of 

tr(H w AyHt) = E Ay (n)|6n„| 2 (182) 

m— 1 n—1 

where £ m „ are independent circularly symmetric complex Gaussian random variables with zero-mean and unit- 
variance. Its probability density function is given by |37l , (38): 

( 2 , _ 1 ffi"'^ r) ^ (^(l),...,^^)) 

""' 1 ' n^A^ (r) (r)"^ * KW-r')! 

.(_ d 2 ) n^M-/. e -^ T) ^ .„ ,183, 
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where 



* P , r , K(l), . . . , n r r[ 3 {R t3 )) = (-1)^ W" 1 £ g ( 



Hi,, 

fci,...,feif. . efi „, fe=i 



1 1 



K 3 (k) Kj(r) 



(184) 



with 



n ry = {k u k Rij e : fej H h fciiy = / - 1, fc r = 0, fc fc > V A:} (185) 

Let us consider the scenario where n t = 2 and n r = 1. It is possible to show from ( 1183b . ( 11841 ) and ( 11851 ) that 
if Ajj(l) > and Ay (2) > 0, which implies that = 2, then 

Pd2 (0) = (186) 
ij 

and 

Pd '> ^ ^ Ay (1) Ay (2) ^ 

whereas if Ay(l) > and Ay (2) = 0, which implies that ry = 1, then 

^(o) = y4tt^° (188) 

Let us now consider the scenario where nt = 2 and ?i r = 2. It is also possible to show from ( 11831 ). ( 11841 and 
(fT85T > that if Ay(l) > and Ay (2) > then 

p$\o)=0, n = 0,1,2 (189) 

ij 

and 

where as if A^^ (1) > and Ay (2) = then 

Pd?(0) = (191) 

and 

Therefore, the Theorem follows immediately from the high-snr characterization in (11731 ). (1174b . ( 1175b . d 1 76b and 
(fT77b . 

■ 

In principle, it is also possible to generalize the result for canonical i.i.d. multiple-antenna Rayleigh fading 
coherent channels with an arbitrary number of transmit and receive antennas. However, this requires considerable 
algebraic manipulation due to the form of the probability density function in ( |183b , dl84b and d 1 85b . 
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Note that if n t > t, then the maximum possible value for is t; in contrast, if n t < t, then the maximum 
possible value of r^ is n t . Note also that if r,j = nt, V i ^ j, then the maximization of: 



corresponds to the maximization of: 



{ij}en r=i 



M M 



We conclude that the space-time code design criteria embodied in Theorem [6] are akin to the conventional rank 
and determinant design criteria in ll39l , which have been derived from the pairwise error probability view point. 
Since the average error probability, the average MMSE and the average mutual information exhibit identical high- 
snr behavior, the design criteria put forth in Theorem [6] in addition to maximizing the lower bound to the average 
mutual information, also minimize the upper bounds to the average MMSE and the average error probability. 

VII. Conclusions 

By drawing on the I-MMSE identity and counterparts, together with key results in asymptotic analysis and 
expansions, it has been possible to put forth a high-snr characterization of the constrained capacity of multiple- 
antenna fading coherent channels driven by arbitrary equiprobable discrete inputs. This characterization has enabled 
the study of the effect of various system models, parameters, and elements on the constrained capacity of multiple- 
antenna fading coherent channels. Key contributions include: 

• In the antenna-uncorrelated Rayleigh fading coherent channel, we have analyzed the impact on the high-snr 
behavior of the constrained capacity of the number of transmit antennas, the number of receive antennas and 
the characteristics of the multi-dimensional constellation. It has been observed that in the regime of high-snr the 
constrained capacity increases with the increase in the number of transmit antennas and the number of receive 
antennas, as well as with the use of multi-dimensional constellations with better sphere-packing properties. In 
particular, it has been observed that in the regime of high-snr the number of receive antennas control the rate 
at which the constrained capacity value tends to its infinite-snr value whereas the number of transmit antennas 
control the constrained capacity power offset, by enabling the construction of multi-dimensional constellations 
with better sphere-packing properties. It has also been observed that the geometry of the multi-dimensional 
constellation, most notably, its sphere-packing properties, also control the constrained capacity power offset. 
It has also been emphasized that in the regime of high-snr the role of the transmit and receive antennas on the 
behavior of the constrained capacity of canonical i.i.d. Rayleigh fading channels is very different from their 
role on the behavior of the capacity of the canonical i.i.d. Rayleigh fading coherent channel, which is achieved 
by Gaussian inputs. 

• In the antenna-correlated Rayleigh fading channel, we have analyzed the impact on the high-snr behavior of 
the constrained capacity of the number of transmit antennas, the number of receive antennas, transmit and 
receive antenna correlation and the characteristics of the multi-dimensional constellation. It has been observed 
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that in the regime of high-snr the constrained capacity also increases with the increase in the number of transmit 
antennas and the number of receive antennas, as well as with the use of multi-dimensional constellations with 
better sphere-packing properties in the coordinate system induced by the transmit correlation matrix. It has been 
found that the presence of transmit and receive antenna correlation has a negative impact on the constrained 
capacity in the regime of high-snr. It has also been found that degenerate conditions, i.e., perfectly correlated 
paths, have an impact on the constrained capacity infinite-snr value as well as the rate at which the constrained 
capacity value tends to its infinite-snr value. 

• In Ricean fading channels line-of-sight components also have profound implications on the behavior of the 
constrained capacity in the regime of high snr. Of particular interest, it has been observed that, whereas in 
a single-transmit single-receive antenna Ricean fading coherent channel the constrained capacity increases 
with the increase in the K-factor in the regime of high-snr, in a multiple-transmit multiple-receive antenna 
Ricean fading channel the constrained capacity may decrease with the increase of the K-factor in the regime 
of high-snr. This result, which does not occur for Gaussian inputs, is due to the nature of the interaction of 
the multi-dimensional constellation with the channel model. 

• The high-snr characterization of the constrained capacity has also enabled the design of elements for multiple- 
antenna fading coherent channel models. We have considered power allocation in a bank of parallel independent 
fading coherent channels, precoding for multiple-antenna Rayleigh fading coherent channels and space-time 
coding for multiple-antenna Rayleigh fading coherent channels, showing that the expansions lead to very sharp 
designs. 

Of particular relevance, the construction of the high-snr constrained capacity characterization has also disclosed 
intimate connections between the high-snr asymptotic behavior of the average minimum mean-squared error, the 
average mutual information and the average error probability in multiple-antenna fading coherent channels driven 
by arbitrary equiprobable discrete inputs. These connections, which suggest that designs that minimize (bounds to) 
the error probability also minimize (bounds to) the minimum mean-squared error and maximize (bounds to) the 
average mutual information, provide a unification of the behavior of key performance metrics in the asymptotic 
regime of high snr. Overall, this high-snr analysis complements the low-snr insight available in J4], 0. 



We determine the lower bound to the MMSE by using a genie that supplies the receiver with the true input 
vector and any of the other input vectors with equal probability. The genie based estimate is given by: 



Appendix A 



Proof of LemmaQ] 



x, 



-genie 




}) 




(195) 
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where Xj is the true input vector and Xj is the other input vector. It follows that the MMSE can be lower bounded 
as follows: 

mmse(snr;H) = E |||Hx - HE{x|y}|| 2 |h| (196) 
1 M 

= M^ E { l|Hx_HE{x|y}!|2 ' X = Xl ' H } (19?) 



i=l 

M M 



M(M - 


1) 


1 




M(M - 


1) 


1 




M(M - 


1) 


1 




M(M - 


1) 



c 15 h} (198) 



A A HHxi-Hx,-!! 2 /||Hxi-Hx 7 -|| 2 snr\ 
E E 4 3 ' mmse B p S K " 3 " (199) 

AAdg(H) /d|j(H)snr\ 
T) E E ' mmse BPSK (^^4^ J ™ 



" "d?(H) i / d?(H)snr\ 

-^E^-^^ly 4— J = mmse -(snr;H) (201) 

The first inequality in d 1 98l > is due to the genie based estimator and the second inequality in ( 12011 ) is due to the 
lower bound to the MMSE of BPSK: 

r°° e -(«-vp) 2 /- 00 e -(n-vp) 2 r°° e -« 2 1 

mmse B PSK(p) = 1 - / tanh(2^£) j= d£ > 1 - / = d£ = 1 - / —=d£ = r ■ erfc(v^) 

(202) 

We determine the upper bound to the MMSE by using a (sub-optimal) Euclidean distance based estimator rather 
than the optimal conditional mean estimator. The Euclidean distance based estimate is given by: 

Xeuc(y) = argmin y - Vsnr • Hx , (203) 
x M ii 

It follows that the MMSE can be upper bounded as follows: 

mmse(snr;H) . | Hx 1 1 . { x y j J |h| (204) 
1 M 

.{ Hx HE{x|y} ' x x,.n} (205) 

i=l 
M 

< -E E {ll Hx - Hi - C (y)H 2 | x = x ^ H } (206) 

1=1 

M M 



= jjf E E E { 1 1 Hx - Hx ™ c (y ) 1 1 2 I x = x, , y e , H } • Pr { y e V, | x = x, , H } (207) 

t=l 3=1 

1 " " , 1 / /djj (H)snr\ . 

^ M E E d ^ H ) ' 2 ' erfc ( V 4 j 4 mmse ^ ( sn H H) (208) 
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where V; is the Voronoi region associated with Hx;. The first inequality in ( |206t is due to the (sub-optimal) 
Euclidean distance based estimator and the second inequality in ( 12081 ) is due to 



1 1 / d?(H)snr\ 
Pr {y e Vj |x = Xj, H} < - • erfc I y (209) 

and , { llx - Hx eMC (y)|| 2 |x = x„ye V,-,h} = d? (H). 

Appendix B 
Proof of Lemma[2] 

We determine the lower bound to the mutual information by using ( fT3T > in ( [ToT l as follows: 

/•OO 

I (snr; H) > log M - / mmse UB (£; H) (i£ (210) 

J snr 



= ** M - £ E E I «t (H) • r I y ^ J * cud 

^ M M /.oo -, d ? (H)5 

>kM--^^ d?(H).-.e-^-de (212) 
i=i j=i ^ snr 

1 d-(H)snr 

= 1 °S M -M^^ 2 ' r ^"' LB(snr;H) (213) 

i=i j=i 

The inequality in ( 1212b follows from the upper bound to the complementary error function erfc (a;) < e~ x2 . 
We determine the upper bound to the mutual information by using ( fT4b in ( [ToT l as follows: 



l(snr;H)<logM- / mmse LB (£; H) d£ (214) 

J snr 

^ logM-— -J -f V f SM . - ■ erfc ( * 



(215) 



3 = 



1 " m d 2 H snr 1 / d H S nr . 

l 0g M + — X^X^-!i±-L erfc \ \ ^— (216) 

B MM - 1) 4 2 IV 4 ] 

x 7 z=l j — 1 



1 ^ m d?(H)snr i 



djj(H)s„ 

e 



TyEEV^ 2'-7^ < 217) 



1(M - 1) 

4=1 j = l 

M M 



'd? (H) snr 



1 ^ m / d?(H)snr\ 

* log M - mTm^I) E E i ■ erfc V Jl V- - luB (snr; H) (219) 



l(M - . 

' »=i j=i 



The inequality in (|2191 l follows from the upper bound to the complementary error function erfc(x) < e x /Vn ■ x 2 . 
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Appendix C 
Proof of Lemma[3] 

The lower and upper bounds to the average value of the MMSE are obtained from the lower and upper bounds 
to the MMSE, respectively, by averaging over the fading statistics as follows: 

mmsei_B (snr) = Eh {mmsei_B (snr; H)} 



i=l 3=1 



iS?EE/ n 8M(M-l) -^ Snr - erfC \ 



d?snr \ / o \ o 

>- Pd 2(d3)dd2 (220) 



mmseuB ( snr ) = {mmseuB (snr; H)} 



1 v^v^ f 1 



i=l j=l 



-VV — -d^snr-e 



rfc 



' d?snr 



P<r, (djj)ddjj 



(221) 



We establish the asymptotic expansions as snr — > oo of the integrals that compose the lower bound and the 
upper bound to the average value of the MMSE in (12201 i and (122U . respectively, by using |29] Theorem 3.2]. This 
Theorem requires that: 



/°° f°° I to 1 

dt n -2 ■ J t ■ erfc ( y j dt 



< a„(t) • 0„(A), n = 1, 2, . . . , N + 2 (222) 



These conditions are satisfied with a„ (t) — n ■ 4" +1 and </>„ (A) = 1 because, by using the well known upper 
bound to the complementary error function erfc(x) < e~ x2 ,x > 0, it is possible to prove that: 



dt n -i 



to ■ erfc I \l ^ ) dto 



< (Xt + An) ■ 4" • e- xt/4 < n ■4 n+1 ,n = 1,2,..., N + 2 (223) 



This Theorem also requires that the functions (d?) , n = 0, 1, . . . , N + 1, are continuous on [0, oo), p^2 +2> (djj 



(N+2) /r , 2 \ 



(224) 



is piecewise continuous on [0, oo) and ^ 

poo 

/ a„(d?)#(d?)^<oc 

These conditions are satisfied because, by assumption, the functions pw (djj),n = 0,1,..., are continuous and 
integrable on [0,oo). 

Consequently, the asymptotic expansions as snr — > oo of the integrals that compose the lower bound and the 
upper bound to the average value of the MMSE are given by: 

n=0 



N 1 
Z_, snr n+l 



N+2 



(225) 



N 1 /l 



n=0 



r N+2 



(226) 



12 The additional requirement that Jg°° a n (d? j Pjg (djj) ^ij < oo is due to the fact that the integration interval is semi-infinite. 



54 



where the underlying auxiliary asymptotic sequence is {(1/snr)™, n = 1,2,...} and fci_B„ an d kiiB n are given by: 

1 



«UB„ 



dt n -i 



dt n -i 



(It,, 



(It,, 



8M(M - 1) 



tn ■ erfc 



to 



dto 



2M ' *° ' erfC ' 



(227) 



(228) 



We now establish the asymptotic expansions as snr — > oo of the lower and the upper bounds to the average value 
of the MMSE in ( 1220b and ( 122 li t, by using |29l Theorem 1.7.1], The asymptotic expansion with respect to the 
asymptotic sequence |snr~"} as snr — > oo of the sum of the integrals in (1220b and (122 lb is equal to the sum of the 
asymptotic expansions with respect to the asymptotic sequence {snr - "} as snr — > oo of the individual integrals in 
(12201 i and (1221b . because d2251 l and ( 1226b are asymptotic expansions of Poincare type. 

Consequently, the asymptotic expansions as snr — > 00 of the lower bound and the upper bound to the average 
value of the MMSE are given by: 



N MM 

mrnse LB (snr) = £ — — ■ fc LB „ +1 ■ J^&djj^ ) 



n=0 



N 



i=l j=l 



o 



snr 



N+3 



(229) 



/ 



mmse UB (snr) = ]T • k UBn+1 ■ 



n=0 



M M 



\ 



i=i i=i 



o 



J 



1 



snr 



N+3 



(230) 



where the underlying auxiliary asymptotic sequence is {(l/snr) n+1 , n = 1,2,...} and fc|_B„ and fcuB„ are also 
given by (1227b and (1228b . respectively. 

The values of fc|_B„ an d ^ub„ can also be computed by using 



erfc I 1/^ 



00 e -^{x 2 +l) 



-dx 



in (1227] ) and ( 12281 , so that: 

2 r °° 



"LB„ 



1 



n • 4 



ra+1 



7T 7 8M (M - 1) ( x 2 + X ) 
2 f 00 1 n-4" +1 



1 



•4" +1 T(n + 3/2) 



«UB„ 



8M(M-1) T(n + l/2) 

1 n-4™ +1 T(n + 3/2) 



rdx 



2M 



r(n+l/2) 



(231) 



(232) 



(233) 



Wo 2M ( x 2 + l) n + 2 
where T (•) is the Gamma function. 

Finally, since the functions pjj" (dy) are continuous and integrable on [0,oo) for arbitrarily large n and d2221 i 

ij 

also holds for arbitrarily large n, it is possible to let TV go to infinity in the asymptotic expansions of the lower 
and upper bounds to the average value of the MMSE in d2291 l and ( 1230b . respectively. 
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Appendix D 
Proof of Lemma|4] 

The lower and upper bounds to the average value of the mutual information are obtained from the upper and 
lower bounds to the average value of the MMSE, respectively, by using (l27l i as follows: 



Ilb (snr) = log I 



2ub(£)g?? 



Iub (snr) = logM - / mmseLB(£)d£ 



(234) 
(235) 

We now obtain the asymptotic expansions (as snr — > oo) of the lower bound and the upper bound to the average 
value of the mutual information directly from the asymptotic expansions (as snr — > oo) of the upper bound and 
the lower bound to the average value of the MMSE, respectively, by capitalizing on l29l Theorem 1.7.6]0 This 
Theorem requires that: 



/•oo 

/ mmseLB(C)^ < 00 

J snr 

/>oo 

/ mmseuB(C)^ < 00 

J snr 



and 



nr Z 



—rrd£ < oo, n = 1, 2, . . 

71+1 ^ 1 



(236) 
(237) 

(238) 



It is immediate to show (e.g. by substituting (1220b and (122 U in d2361 l and d2371 i, respectively, and trivially bounding 
the value of the integrals) that the integrals (I236l l and (1237b exist for snr > and, likewise, that the integral (12381 ) 
also exists for snr > 0. Consequently, the asymptotic expansion as snr — > oo of the lower and upper bounds to the 
average value of the mutual information are given by: 



N 1 

| LB (snr) = logM-£— — ■ k' LBn+i 



( 



n=0 



M M 



\ 



i=l 3 = 1 



o 



1 



snr 



N+2 



J 



(239) 



N 1 

Tub (snr) = log M - ]T • /c UBn+1 



M M 



i=l j=l 



J 



o 



1 



r N+2 



(240) 



where the underlying asymptotic sequence is {(l/snr) n , n = 1, 2, . . . | and k' LB and k[j B are given by: 



in+l 



LB,, 



CUB,, 



T(n + 3/2) 
2M ' T(n + 1/2) 



(241) 



13 Note that we determine the asymptotic expansions of the lower and upper bounds to the average mutual information directly from the 
asymptotic expansions of the upper and lower bounds to the average MMSE in conjunction with 427i . Alternatively, we can also determine 
asymptotic expansions of the lower and upper bounds to the average mutual information by adopting a procedure identical to that in the average 
MMSE case leveraging the bounds to the mutual information in Lemma [2] 
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y 1 i- 1 4 " +1 r(n + 3/2) 

fc UB „ = - ■ fcLB„ = 8M(M _ 1} • ^ • r(n+1/2) (2*2) 

We can also let go to infinity in the asymptotic expansions of the lower and upper bounds to the average 
mutual information in (1239b and (12401 i. respectively. 

Appendix E 
Proofs of Theorems[T]and|2] 

The proofs capitalize on the asymptotic expansions of the upper and lower bounds to the average value of the 
MMSE and the average value of the mutual information in Lemmas [3] and |4] respectively, given by: 

MM / 1 \ 

mmse L B(snr) = k LBd -^^^^(O) ■ +0 (243) 

i=l 3=1 " V 7 

MM / 1 \ 

mrnse UB (snr) = fc UBd • EE^"^ ) ' + ° (244) 



and 



,1 3=1 



MM / 1 \ 

l LB (snr) = log M - fc^ Bd • ]T ]T ^ (0) • — + — ^ (245) 



t=l 3=1 



MM / 1 \ 

l UB (snr) = log M - fc' UBd . g E^" ( ° } ' inH + ° (snr^+i j (246) 



i=l 3 = 1 



Let us define the functions: 



and 



as well as the functions: 



/ (snr) = snr d+1 • mrnse(snr) (247) 



3 (snr) = snr d • (log M - I (snr)) (248) 



/i (snr) = < 



/ (snr) , fc LBd ■ E&=ip2 -1) (0) < / (snr) < fc UBd ■ E^i^ _1) (0) 

fc LBd • E«=ipg _1) (0), /(snr) < fc LBd • E&=iJ# _1) (0) (249) 

i^3 'J i^j 'J 

fcu Bd ■ E&=U>2 _1) (0), /(snr) > fc UBd • ESj=ipS _1) (0) 



/(snr) - fc UB<1 • Eifj=iPd2 /( snr ) ^ fc UB d • E^=iP d 2 ^(0) 

/ 2 (snr) = <( ijtj i& ij (250) 

0, otherwise 

-/(snr) + fc LBd • E&=U# -1) (0), /(snr) < fc LBd ■ E&=U>2 _1) (0) 

/ 3 (snr) = { ift ift ij (251) 

0, otherwise 
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and the functions: 



5 (snr) , 

n (snr) = { ^uB d ' Eifj=iPd|~ (°)' 



fcue d • E^ipSa" 13 ^) < 5(snr) < fc^ Bd • E^=i pg^ (0) 



92 (snr) 



5 (snr) - fc[ Bd • E%=ip2 _1) (°). 5 (snr) > k' LBd ■ E^=i^H(0) 



1 

j=i 

M (d-1), 

i¥=3 

otherwise 



(252) 



(253) 



and 



33 (snr) 



- g (sm) + k' UB<) -Ym-iP^Ho), 3(snr) < ^ • E%=ipS _1) (0) 



0. 



otherwise 



(254) 



In view of the asymptotic expansions in ( 12431 ), ( 1244b . ( 1245t and ( |246t . it is possible to bound / (snr) and g (snr) 
for snr > snro, where snro is a sufficiently high value of snr, as follows: 



M M 



(255) 



and 



M M 



i=l 3=1 



M M 



fc ue d • EE^ _1) (°) - <B ' ^ < (snr) < fc^ Bd • EE^W + c'ub ■ ^ 
— , * u snr — — u snr 



(256) 



i=l j=l 



i=l 3 = 1 



where clb, cub, c' lb and c' UB are positive constants. Note thus that f 2 (snr), fa (snr), g 2 (snr) and 53 (snr) can be 
bounded as follows: 



I/2 (snr)| = f 2 (snr) < c UB 
I/3 (snr)| = / 3 (snr) < c L b 
\92 (snr)| = 52 (snr) < c' UB 
1 93 (snr) I = 53 (snr) < c' LB 



1 


= 


f 1 


snr 




^snr 


1 


= 




snr 




v snr 


1 


= 


( 1 


snr 




^snr 


1 


= °( 




snr 




1 snr 



(257) 



(258) 



(259) 



(260) 



Note also that f\ (snr) and g\ (snr) are piecewise infinitely differentiable because / (snr) and g (snr) are infinitely 
differentiable and 



fc LBd • E Pd^ 1} (°) ^ h (snr) < fc UBd ■ E P*"^ ) 



(261) 



58 



and 



fc u Bd • E pS _1) (°) < .91 ( snr ) < fc L Bd • E ^ 

i,j=l ij = l 

Therefore, Theorems [1] and [2] follow immediately by noting that: 



and 



/ (snr) = /i (snr) + / 2 (snr) - f a (snr) = h (snr) + £>( — ) (263) 



.g (snr) = gi (snr) + g 2 (snr) - # 3 (snr) = g 1 (snr) + £> ( — ) (264) 



and, in view of ( 12631 and (|264t together with (l44l . d45l d46l > and d47b , 

limsup/ (snr) = limsup/i (snr) = and liminf / (snr) = liminf /i (snr) = e d (265) 

snr->oo snr->oo snr->oo snr-s-oo 

and 

limsup g (snr) = limsup g% (snr) = e' d and liminf g (snr) = liminf g\ (snr) = (266) 

snr^oo snr^oo snr-s-oo snr^oo 

Appendix F 
Proof of Lemma[5] 

We obtain the lower bound to the error probability by using a genie that supplies the receiver with the true input 
vector and any of the other input vectors with equal probability. Then, 

i M 

P e (snr; H) = Pr (e|H) = ^ E Pr ( e l x = x » H ) 

. MM 

^M(M^I)EE^(x^x J |x.x,H) 

v ; 1=1 j=l 

1 A^- 1 ( /d?(H)snr\ 
= Wi^lJ E E 5 ■ •* I ^ ^ » P., (snr; (H)) (267, 

where Pr (e|x = x,, H) is the probability of error when Xj is transmitted given the channel matrix and Pr (x, — > Xj |x = x,;, H) 
is the probability of choosing Xj over x, when Xj is transmitted given the channel matrix. The inequality is due to 
the use of a genie. 

We obtain the upper bound to the error probability by using the well-known union bound (31 ]. Then, 

1 M 

P e (snr; H) = Pr (e|H) = 77 E Pr ( e l x = x - H ) 



M 

i=i 



^ M M 



1=1 3=1 
M M 



M 2 

i=i j=i 



(268) 
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where Pr (e|x = X,, H) is the probability of error when x* is transmitted given the channel matrix and Pr (x, — > Xj |x = 
is the probability of choosing Xj over x, when Xj is transmitted given the channel matrix. The inequality is due to 
the union bound. 

Appendix G 
Proof of Lemma[6] 

We follow the previous procedure to determine the asymptotic expansions of the lower and upper bounds to the 
average error probability. We obtain the lower and upper bounds to the average value of the error probability from 
the lower and upper bounds to the error probability by averaging over the fading statistics. The bounds are given 
by: 

M M 



P eLB (snr) =E H {P eLB (snr;H)} =]T]T / 

l= i ,=i J o 



1 



3 = 



2M(M - 1) 



erfc 



' djjsnr 



Pd 2 (d 2 ) ddl (269) 



P euB (snr) =E H {Pe LB (snr;H)}=]r^ / 2M ' erfc 



i=l j=l 



' d 2 snr 



(270) 



We obtain the asymptotic expansions as snr — > oo of the lower and upper bounds to the average value of the 
error probability by capitalizing on l29l Theorem 3.2] and 11291 Theorem 1.7.1]. The asymptotic expansions are 
given by: 



P eLB (snr) = 



N 1 

snr n+l 



/ 



M M 
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LB,, 



N 



Pe UB (snr) = — ^ ■ k'^ Bn+1 ■ 
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EE4 )(0) 
/.. .. \ 



n=0 



M M 



o 
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1 



snr 



N+2 
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snr 



N+2 



J 



(271) 



(272) 



where the underlying auxiliary asymptotic sequence is {(1/snr)™, n = 1,2,...} and k'^ B and fcy B are given by: 



"LB„ 



dt n -i 



fit,, 



1 



k" 

K UB„ — 



dtn-1 



dt r , 



1 2M(M-1) 

OO -i 



erfc ( y J ) 



tn-l 



2M 



erfc 



dt 



(273) 



(274) 



The values of k" B and fc(j B can also be computed by using 

erfc 



x 2 + 1 



-dx 



(275) 



in (12731 ) and ( 12741 , so that: 



Hb„ 



4" 



(M-l) (z 2 + l) 



Tt + l 



dx 



1 4™ r(n+l/2) 



2M(M-1) r(n + l) 



(276) 



R UB — ' 



M (3.2 + J 



.71+1 



_ i 4™ r (?i + i/2) 

2M ' ' r(n + l) 



(277) 
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where T (•) is the Gamma function. These expansions hold, once again, for arbitrarily large N. 

It is also simple to verify the conditions that justify the application of ||29l Theorem 3.2] and |29l Theorem 1.7.1]. 
For the asymptotic expansion of the individual integrals in ( 12691 ) and ( 1270b . it can be shown that 

< an(t) ■ <f> n (X), n = l,2,... (278) 

with a n {t) = 4™ and (f>„ (A) = 1. Furthermore, by assumption, the functions (dj?) ,n = 0, 1, . . . , are continuous 
and integrable on [0, oo) and 

/>oo 

/ o„(d2)pW(dg)ddg<oo (279) 
Jo 

For the asymptotic expansion of the lower and upper bounds to the average value of the error probability, we use 
the fact that the asymptotic expansions are of Poincare type to write the asymptotic expansion of the sum as the 
sum of the asymptotic expansions. 
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